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Accurate energy ranking of molecular crystals using DFT 
calculations and empirical Van der Waals potentials* 

1 Introduction and industrial context 

The present invention refers to a method for the accurate determination of van der Waals 
parameters used in conjunction with density functional theory calculations for high-precision 
determination of crystal structures and/or energies, and based on the thus determined accurate 
parameters, a method for the accurate determination of crystal structures and/or energies, in 
particular in view of the energy ranking of polymorphic crystal structures. 

The physical and chemical properties of molecular compounds in the crystalline state strongly 
depend on the precise nature of the molecular arrangement. For most molecules, several 
crystal polymorphs can be observed experimentally that may differ significantly with respect 
to their density, hardness, crystal habit, colour, solubility, dissolution rate and other important 
properties. Therefore, the characterization and the control of polymorphism are of prime 
importance for various industrial applications such as the production of explosives, pigments 
and food products. In the pharmaceuticals industry, strict regulatory rules apply to make sure 
that only well-defined polymorphic forms are brought to market, since changes of the 
solubility and the dissolution rate affect the bioavailability of a drug compound. One 
particular problem for the pharmaceuticals industry is the fact that the most stable polymorph 
may have very unfavourable crystallization kinetics. It may therefore happen that the most 
stable polymorph appears accidentally many years after the first crystallization experiments, 
at a point when clinical trials and production scale-up have already been carried out for a 
meta-stable polymorph. Once the production site has been contaminated with the most stable 
polymorph, the controlled crystallization of the desired polymorph may turn out to be 
impossible. This problem, known as the phenomenon of disappearing polymorphs, may cause 
serious delays and losses of revenue [Dunitz & Bernstein, 1995], 

The reasons outlined above have triggered a substantial amount of research in the field of in 
silica polymorph screening, commonly called polymorph prediction, a technique aiming at the 
prediction of the experimentally accessible polymorphs of a given molecule by computer 
simulation. The state-of-the-art is illustrated by the results of two blind tests organized by the 
Cambridge Crystallographic Data Center [Lonunerse 2000; Motherwell 2002]. In silico 
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polymorph prediction can be roughly divided into three steps: structure generation, energy 
ranking and the prediction of crystallization kinetics. Structure generation with a tool like 
Accelrys' Polymorph Predictor [Accelrys 2004] is usually quite reliable for rigid and slightly 
flexible molecules. Energy ranking, on the contrary, remains an important challenge. Many 
different low energy crystal structures are typically found in an energy window of 1 kcal/mol 
and the current accuracy of lattice energy calculations is not high enough to allow for the 
identification of the most stable polymorphs. The prediction of crystallization kinetics,, i.e. the 
identification of the crystal polymorph that is the easiest to crystallize, is completely beyond 
reach. 

The current inability to pin-point the experimentally observable crystal structures in a. list of 
generated crystal structures is a serious drawback for industrial applications. Since different 
polymorphs, even if close in energy, may have very different properties, in silico polymorph 
prediction is virtually useless as long as it remains impossible to identify those polymorphs 
that can be crystallized. In the past, in silico polymorph screening has sometimes been used 
for crystal structure solution in conjunction with low quality powder diffraction data, "but this 
approach has now been replaced by more powerful methods for direct crystal structure 
solution from powder diffraction data. 

In this document we describe a new method for accurate lattice energy calculations tttat will, 
in general, allow for the identification of the most stable crystal polymorph within a list of 
candidates. The new method will have a strong impact on the application of in silico 
polymorph screening to industrial problems. Having predicted the most stable polymorph by 
computer simulation, it will virtually always be possible to find the right conditions to 
crystallize it. This means that for every molecule, even if it has never been synthesized, one 
obtainable crystal polymorph with its physical and chemical properties can be identified by 
computer simulation. It will thus become possible to use computational tools in order to 
design new materials with the desired colour (pigments), solubility (pharmaceuticals), 
hyperpolarisability (laser technology) and other wanted features. In the pharmaceuticads area, 
the new method will be used to identify and crystallize the most stable polymorphic form, 
thus avoiding the problem of disappearing polymorphs. 

2 Prior art 
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Current energy calculations can be roughly divided into three categories: ab initio 
calculations, semi-empirical calculations and force field calculations (see [Wimmer 2004] for 
a basic introduction and [Leach 2001] for details): 

Ab initio calculations are the most accurate and the most CPU-time consuming. They take 
into account the quantum nature of the electronic movement around the atomic nuclei. To 
calculate the energy for a given set of atomic positions, it is necessary so solve Schodinger's 
equation for the electronic motion. The analytical solution of Schodinger's equation is only 
possible in the simplest cases (hydrogen atom, etc). In general, the solution of Schodinger's 
equation involves a certain number of approximations and is carried out numerically. 
According to the nature of the approximations, ab initio calculations can be subdivided into 
three big families: Hartree-Fock (HF) calculations, Density Functional Theory (DFT) 
calculations and Quantum Monte Carlo (QMC) simulations [Foulkes et al 2001]. 

Basic HF calculations are not very accurate because of the neglect of electron correlation. 
However, augmented by perturbation theory or configuration interaction calculations, HF 
calculations offer a very high accuracy. In theory, the accuracy of this type of calculation is 
only limited by the available CPU resources. In practice, the CPU time requirements increase 
very rapidly with the system size (= number of atoms and electrons) and they are therefore 
inappropriate for the energpranking of molecular crystals. va 

In DFT calculations, an approximation is used for the treatment of electron correlation effects. 
At comparable accuracy, DFT calculations are usually faster than HF calculations and scale 
more favourably with system size. DFT calculations on crystal structures can be carried out 
on a state-of-the-art PC and offer satisfying results for framework structures (zeolithes, etc) 
and ionic crystals. However, because of the approximate treatment of electron correlation 
effects, the accuracy of DFT calculations is limited. In particular, the long range Van der 
Waals (VdW) interactions (also called dispersive interactions) between neutral (or charged) 
atoms are not taken into account. Van der Waals interactions play an important role in 
molecular crystals, and pure DFT calculations for molecular crystals are therefore not very 
accurate. 

QMC simulations are highly accurate but require very long calculation times. They are 
therefore limited to systems containing no more than a few atoms. 
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Semi-empirical calculations are related to ah initio calculations. They can be derived by 
introducing further approximations and adjustable parameters. The adjustable parameters 
need to be calibrated with respect to experimental data or high level ab initio calculations. 
Semi-empirical methods are currently not accurate enough for the accurate energy ranking of 
molecular crystals. 

Force fields are purely empirical in nature and use a set of functional forms and parameters to 
describe the potential energy as a function of the atomic coordinates. If a force field is 
carefully parameterized for a particular molecule or a class of related molecules, force field 
calculations can turn out to be fairly accurate and are sometimes appropriate for the accurate 
energy ranking of simple molecular crystals. For most molecules of industrial interest, 
however, well parameterized force fields are not readily available. Force field parameters are 
generally adjusted for a limited set of rather small molecules and it is assumed that they can 
be transferred to larger molecule. This parameter transfer i reduces energy errors that are 
incompatible with the needs of in silico polymorph screening. Ira. addition, today's force fields 
perform poorly for systems with strong electrostatic interactior*_s such as molecular salts and 
zwitterions (e. g. glycine) that play an important role in pharmaceutical applications. 

In summary, it can be said that — within the current memory and CPU time constraints - none 
of the methods mentioned above offers the accuracy requijred for in silico polymorph 
screening. 

In recent years, several attempts have been made to increase the accuracy of DFT calculations 
by adding empirical potential energy functions that model the long range dispersive 
interactions [Elstner et al 2003; Wu and Yang 2002; Liu et al 2O01; Elstner et al 2001]. In all 
these approaches, an additional potential energy term of the following type is used: 

E disp = A*(^)% yL (Eq. 2.a) 

Here A and B run over all pairs of interacting atoms, r AjB is the distance between the 
interacting atoms, f^ B is a damping function and C 6 ,a,b is a co&ifficient that characterizes the 
strengths of the interaction. The -Ce/r 6 term captures the well kaown asymptotic behaviour of 
dispersive interactions at large interatomic distances. To avoid the unrealistic divergence of 
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the empirical potentials at short interatomic distances, a damping functions f^B is used. The 
C6AB interaction coefficients can be determined from experimental data or high level ab initio 
calculations. 

From now on, we will call a method that combines DFT calculations with empirical Van der 
Waals potentials a hybrid method. In the scientific literature [Elstner et al 2003; Wu and Yang 
2002; Liu et al 2001 ; Elstner et al 2001] the accuracy of the existing hybrid methods has been 
assessed by comparison with high level HF calculations. Disagreements seem to be of the 
order of 1 kcal/mol, which is far too much for the accurate energy ranking of crystal 
structures. The published methods have been applied to rare-gas diatomic molecules, the 
stacking of base pairs, polyalanines conformation stabilities and protein modelling. 

In the hybrid method proposed in the prior art, only the C6,a,b coefficient is numerically 
adjusted based on reference data which describe long-range molecular interactions, taking 
advantage of the fact that on a large scale, the interaction between molecules is dominated by 
a van der Waals potential, whereas contributions expressed by DFT calculations are 
negligible. As a consequence, such calculations are unsuitable for numerically simulating 
crystal structures, i.e. molecular arrangements on a geometrical scale at which DFT 
contributions and van der Waals contributions have the same order of magnitude. 

w 

It is therefore an object of the invention to propose a method for the accurate determination of 
van der Waals parameters which is suitable for the high-precision determination of crystal 
structures and/or energies. 

According to the invention, this object is achieved by a method as defined in claim 1. 
Advantageous embodiments are defined in dependent claims 2 to 9. 

A particularly efficient numerical optimization method using an advantageous crystal 
coordinate system is defined in claim 10. 

A general advantageous method for the energy ranking of polymorphic crystal structures 
based on a hybrid method is defined in claim 11. 
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In this document we describe an improvement of the hybrid approach that makes it suitable 
for the accurate energy ranking of crystal structures. At present, the scientific corxirnunity has 
not realized that for highly accurate calculations the careful adjustment of tlie damping 
function f is as important as the determination of appropriate Ce coefficients. Int addition, it 
has not been realized that hybrid methods with properly adjusted empirical parameters offers 
the accuracy required for the accurate energy ranking of crystal structures in the context of in 
silico polymorph screening. 

It is important to note that the term 'hybrid method' is often employed in a different context. 
To study complex phenomena such as protein ligand interactions, complex systems are often 
subdivided into two subsystems. For instance, the smaller subsystem may consist of the active 
site of the protein and a ligand, while the larger subsystem contains the backbone of the 
protein and the solvent For the smaller subsystem, the potential energy is determined by high 
level ab initio calculations, while a force field is used to describe the interactions between the 
two subsystems and within the larger subsystem. The case just described is different from the 
hybrid method described in this document, where both the high level DFT calculations and 
the empirical potential energy terms apply to all atoms in the system. 

The hybrid method described in this document provides the potential energy as a function of 
the unit cell parameters and the atomic positions in the unit cell. The crystal structures 
observed in nature approximately correspond to local minima of the potential energy as a 
function of the structural parameters. Lattice energy minimization, that is the determination of 
the structural parameters that correspond to the lowest value of the potential energy, is 
therefore a key step for the accurate energy ranking of crystal structures. As we will see later, 
lattice energy minimization is also a key step in the refinement of the empirical parameters 
used with the hybrid method. Since energy calculations with the hybrid method are very time 
consuming, a new efficient lattice energy minimization algorithm for molecular crystals has 
been developed and is described in this document. We therefore briefly review the state of the 
art in the field of lattice energy minimization. 

In principle, lattice energy minimization is nothing else but the minimization of a function 
with respect to its variables. Seen from this angle, lattice energy minimization is a standard 
task (see [Press et al 2002] for more information on standard minimization algorithms). 



-6- 



WO 2005/091178 



PCT/EP2005/050778 



However, the efficiency of the lattice energy minimization procedure is strongly related to the== 
choice of the coordinate system used to describe the crystal structure. Depending on the^ 
coordinate system, the potential energy surface around the minimum is more or less harmonio 
and most optimization algorithms work best on harmonic potential energy surfaces. 

A straight forward choice is to define a crystal in terms of its space group, its lattice 
parameters (or direction cosines or other related variables) and the positions (in fractional or 
Cartesian coordinates) of the atoms in the asymmetric unit. The lattice energy minimization, 
algorithms in DFT codes such as VASP [Kresse and Hafiier 1993 & 1994, Kresse and 
Furthmiiller 1996 & 1996b, Kresse and Jouber 1999] and CASTEP [Milman et al 2000] work 
along these lines. Fractional/Cartesian coordinates are an appropriate choice for many 
inorganic crystals, but they turn out to be very inefficient for molecular crystals. For isolated 
molecules, it has been shown that so called internal delocalized coordinates [Baker 1997; 
Baker et al 1996; ] perform significantly better than Cartesian coordinates. A similar approach 
has been shown to work for crystal structures of 3D covalently bonded networks [Andzelm et 
al 2001] and has been implemented in the DFT code DMol3 [Delley 1990; Delley 2000]. 
However, the unit cell was not allowed to vary and the approach has not been combined with 
space group symmetry. Details about the implementation are not readily available, but it is 
known that the program can also cope with molecular crystals. This may be achieved by the 
introduction of fictive additional bonds between molecules to create a 3D bonded network 
The use of delocalized internal coordinates for the optimization of periodic systems with a 3D 
network of covalent bonds has also been reported by a second group [Kudin et al 2001] . In 
this case, the unit cell is allowed to vary, but once again the approach is not combined with 
space group symmetry (only the point group of the unit cell is taken into account). Molecular 
crystals without a 3D network of covalent bonds are dealt with by introducing additional 
bonds, for instance the hydrogen bonds between molecules. 

Both of the above mentioned generalizations of the concept of delocalized internal 
coordinates to the case of molecular crystals have the disadvantage that the introduction of 
additional bonds is somewhat arbitrary and that there is no clear distinction between 
intramolecular and intermolecular degrees of freedom. In general, intramolecular interactions 
are much stronger than intermolecular interactions, and it is therefore desirable to decouple 
the intramolecular and the intermolecular degrees of freedom. In addition, both 
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generalizations are not combined with a full treatment of space group symmetry, thus 
increasing significantly the number of parameters to be refined. 

In this document, we describe a coordinate system based on internal delocalized coordinates 
(for intramolecular degrees of freedom), whole molecule translations and rotations and unit 
cell changes. We also show how this coordinate system can be combined with space-group 
symmetry. The new coordinate system is the 'natural choice' for molecular crystals, but has, 
according to our knowledge, never been implemented. Probably because of the fairly 
complicated coordinate transformations that are involved* 

3 Main embodiments of the invention 

3.1 A highly accurate hybrid method for lattice energy calculations 

The invention provides a hybrid method for the calculation of lattice energies and forces (= 
energy derivatives) in molecular crystals that is - for given CPU time requirements and 
system size - more accurate than any other method currently available. 

The invention al&o provides a recipe for the determination of the empirical parameters used 
with the hybrid method. The accuracy of the hybrid method strongly depends on the 
refinement of appropriate empirical parameters. 

3.2 A method for the efficient lattice energy minimization of molecular crystals 

As part of the invention, a more efficient method for the crystal structure optimization of 
molecular crystals has been developed. 

3-3 A method for the accurate energy ranking of molecular crystals 

The invention provides a method that is suitable, unlike any other method., for the reliable 
energy ranking of the various polymorphic crystal structures of one and the same molecule. 
Accurate energy ranking is essential in the context of in silico polymorph screening. 



-8- 



WO 2005/091178 



PCT/EP2005/050778 



These embodiments are closely linked and can be combined with each other. 
4 Detailed description of preferred embodiments 

Preferred embodiments of the invention will be described in detail with reference to the 
accompanying drawings, in which: 

Fig. 1 shows a schematic representation of the interaction energy between two atoms, 
illustrating that to yield the correct interaction energy, the DFT calculation and the additional 
VdW energy need to be complementary. 

Fig. 2 shows the distance dependence of the empirical pair potential calculated for the y- 
polymorph of N 2 . 

Fig. 3 illustrates the influence of the form factor nby showing fA.B(r)*r 6 for three different 
values of the form factor with r A ,B=3.0. The smaller the form factor, the harder the transition 
from the long range r" 6 part to the constant short range part. 

w Fig. 4 shows internal coordinates and Cartesian coordinates fortwater. 

Fig. 5 shows contour plots of the potential energy surface for different definitions of the 
dimensionless parameters qi and q 2 , wherein contour lines represent 10 kcal/mol intervals. 

Fig. 6 shows degrees of freedom describing atomic displacements in the unit cell: 1) Whole 
molecule translations, 2) whole molecule rotations, 3) intramolecular coordinate changes, the 
two molecules in the unit cell being symmetry related. 

Fig. 7 shows the molecular centres following the deformation of the lattice, while the 
molecular geometry and the molecular orientation remain unchanged. 

Fig- 8 illustrates the definition of a bond length, a bond angle and a torsion angle. 

Fig- 9 shows three atoms on a straight line. 
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Fig. 1 0 shows a flow diagram for parameter refinement. 

Fig. 1 1 shows the test set of molecules used for the energy ranking study, comprising from 
left to right: ethane, ethylene, acetylene, methanol, acetic acid, urea. 

Fig. 12 compares the energy ranking of the Polymorph Predictor (UFF with Qeq charges) to 
the energy ranking obtained with the hybrid method according to the invention. 

Fig, 13 shows the three most stable crystal structures of acetylene found with the hybrid 
method according to the invention; for rank 1 and rank 3, a superposition with the 
experimental crystal structure is presented. 

Fig. 14 shows a comparison of the experimental and calculated crystal structures of a-N2 
(left) and y-N 2 (right) for three different calculations, each of the 6 graphics being a 
superposition of two crystal structures. 

Fig. 15 shows a superposition of the calculated and the experimental crystal structure of s- N2. 
Fig. 16 shows the relative enthalpy as a function of pressure for oc-N 2 , y-N 2 and S-N2. 

4.1 A highly accurate hybrid method for lattice energy calculations 

Lattice energies are calculated by a combination of high level DFT calculations and additional 
empirical potentials that model long range dispersive interactions. To achieve optimum 
complementarily between the DFT part and the empirical part of the calculations, the hybrid 
method is used to calculate measurable properties and some or all of the empirical parameters 
are adjusted to reproduce experimental data. The DFT part may involve certain options and all 
choices have to be made such that best complementarity with the empirical potentials is 
obtained. 
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Let us consider the interaction between two neutral atoms (See Fig. 1). Fig. 1 shows a 
schematic representation of the interaction energy between two atoms, illustrating that to yield 
the correct interaction energy, the DFT calculation and the additional VdW energy need to be 
complementary. At large interatomic distances, the contribution of the DFT calculation to the 
total energy is zero, while the additional empirical VdW potentials follow a C6/r 6 -dependence. 
Since the DFT calculation does not contribute to the total energy, the C 6 coefficients can be 
determined independently, i.e. they do not depend on the DFT calculation. At short 
interatomic distances, there may be some contribution from the empirical potentials, but the 
total interaction energy is dominated by the DFT contribution and the contribution from the 
empirical potentials is negligible. 

At intermediate distances, the situation is very different. Both, the DFT contribution and the 
VdW contribution are equally important, and the correct interaction energy is only obtained if 
the DFT calculation and the additional empirical potentials are complementary like the two 
curves DFT1 and VdWl and the two curves DFT2 and VDW2 in Fig. 1. The contribution of 
the DFT calculation at intermediate distances depends on various factors such as the basis set 
and the exchange-correlation functional. As the DFT contribution and the VdW contribution 
are interrelated, the additional empirical potentials need to be carefully adjusted in the range 
of intermediate distances. This can be achieved by the appropriate choice of the damping 
function fA s B (see Eq. 2.a). 

In previous work, only little attention has been paid to the damping function and the 
parameters involved seem to have been chosen rather ad hoc. This may have been motivated 
by the fact that there are much more atom pairs at large interatomic distances than at short and 
intermediate interatomic distances. However, for a given atom, the number of interacting 
partners increases like t 2, with the interatomic distance while the interaction energy decreases 
like r" 6 . The total interaction energy as a function of the interatomic distance therefore 
decreases like r 4 . In other words, most of the interaction energy is concentrated at 
intermediate atomic distances. This fact is further illustrated by Fig. 2. Fig. 2 shows the 
distance dependence of the empirical pair potential calculated for the y-polymorph of N^. The 
figure shows the C 6 /r 6 part of the N-N interaction (dotted curve) and the damped interaction 
energy (dashed curve) according to Wu and Yang [Wu and Yang 2002]. The dash-dotted 
curve represents the damped interaction energy that is obtained if the Rm parameter in the 
damping function of Wu and Yang [Wu and Yang 2002] is adjusted to yield best results in 
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conjunction with the DFT calculations that axe described in section 5.2. The solid curve shows 
a history plot of the VdW interaction energy as a function of the interatomic distance 
calculated for the y-polymorph of N2- The interaction energy is clearly concentrated at short 
distances and is significantly affected by the damping function. The use of the damping 
function proposed by Wu and Yang [Wu and Yang 2002] without parameter adjustment 
would have lead to an overestimation of the total interaction energy (see also section 5.7 and 
Fig. 14 in particular). 

To achieve high accuracy, the adjustment of the damping function is of crucial importance. 
We have already pointed out that the damping function to use depends on the exact nature of 
the DFT calculation and more specifically on the choice of the exchange-correlation 
functional and the basis set. Because of the later dependence, damping functions are probably 
not transferable between hybrid methods that use localized atomic orbitals and hybrid 
methods that use 3D periodic basis functions. 

In this work we are specifically interested in damping functions for hybrid methods that can 
cope with 3D periodic boundary conditions. To adjust the damping functions, experimental 
data are required that can be accurately measured and that are straight forward to calculate. 
Low temperature crystal structures are ideal for this purpose. At low temperature, the average 
atomic positions and unit cell parameters measured by a diffraction experiment correspond 
fairly accurately to the minimum of the potential energy hyper surface, and the minimum 
energy crystal structure is straight forward to calculate using the hybrid method in conjunction 
with a lattice energy minimizer. Appropriate damping functions can thus be determined by 
seeking the best possible agreement between two sets of calculated and experimental crystal 
structures. 

The description of long range dispersive interactions as a sum over isotropic pair potentials is 
only an approximation. In reality, there are also contributions from many body interactions, 
i.e. interactions involving more than two atoms. Since we adjust the empirical pair potentials 
to experimental data, it can be expected that effects such as many body interactions are to a 
certain extend reflected by the obtained parameters. 

Ultimately we Eire interested in the accurate energy ranking of crystal structures and not in the 
calculation of accurate crystal structures. It is not obvious that damping functions adjusted 
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toward structural data are also the best possible choice for the comparison of lattice energies. 
One may argue, however, that crystal structures result from a fine balance between different 
types of interactions. Since electrostatic interactions are described fairly accurately by DFT 
calculations, one may conclude that the VdW component of the lattice energy is dealt with 
accurately if the experimental crystal structures are accurately reproduced. Reliable 
experimental energetic information is only available for very few compounds. In some cases, 
the energy ranking of different polymorphs at low temperature is known experimentally and 
this type of information may be included in the refinement. In addition, one may assume that 
simple, ball-like molecules with essentially isotropic intermolecular interactions always 
manage to reach the most stable crystal structure at low temperature. In that case, the 
experimental low energy crystal structure , must be lower in energy than any other structure 
generated with a tool like Accelrys^ Polymorph Predictor [Accelrys 2004]. This type of 
information may also be included in the refinement. 

So far we have focused on the adjustment of the damping function. If enough experimental 
data is available, one may also refine the Ce coefficients or use more sophisticated 
descriptions of long range dispersive interactions that may involve anisotropic pair potentials 
or many body effects. 

In an- improved embodiment, for a more accurate parameter refinement, the effect of lattice 
vibrations on the average cell parameters, the average atomic positions and the free lattice 
energy may be taken into account at the cost of significantly longer calculation times. 

Additional empirical potential energy terms (hydrogen bond potentials, conformational 
energies) unrelated to long range dispersive interactions may be added to the hybrid method 
to correct for failures of the hybrid method that may have been identifies by the comparison 
of experimental and calculated low energy crystal structures. 

The empirical parameters used with the hybrid method may be adjusted to theoretical rather 
than experimental data. Theoretical data, for example energies, forces and pressure 
components, may be calculated for a set of crystal structures using high level HF calculations 
(with corrections taking into account electron correlation effects) or quantum Monte Carlo 
techniques. Compared to the use of experimental data, this approach has advantages and 
disadvantages. On the one hand, theoretical data may be useful in cases where experimental 
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data are not readily available. In addition, theoretical data are not affected by effects such as 
zero-point vibrations which are difficult to take into account in the refinement procedure. On 
the other hand, the accuracy of theoretical data is affected by the limitations of the 
corresponding calculations and there may be significant deviations from reality. Therefore, 
refinement with respect to experimental data has been preferred in this work. 

4.2 A method for the efficient lattice energy minimization of molecular crystals 

An advantageous feature is the choice of a coordinate system that uses delocalized internal 
coordinates for the intramolecular degrees of freedom and whole molecule translations and 
rotations for the intermolecular degrees of freedom. Lattice changes are described in terms of 
deformations of the starting lattice without any rotational component. Upon lattice changes, 
the fractional coordinates of the molecular centres (of mass, of volume or other) remain 
constant and no rotation of the molecules is induced. Space group symmetry is fully taken 
into account 

The choice of coordinates provides good decoupling of the various degrees of freedom 
involved. The use of delocalized internal coordinates reflects the chemical connectivity and 
takes into account the inherent curvilinear nature of intramolecular structure changes. The use 
of whole molecule translations and rotations allows for whole molecule displacements 
without strong changes of the intramolecular forces. The treatment of lattice changes 
described above insures that lattice changes affect as little as possible the net force and the net 
torque acting on the molecules. Full space group symmetry is required to work with the 
smallest possible number of degrees of freedom. 

The lattice energy and its derivatives are typically calculated using a simple coordinate system 
(called the fractional coordinate system in this document) based on lattice parameters and 
fractional atomic coordinates. In comparison to the fractional coordinate system, the natural 
coordinate system involves time consuming additional coordinate transformations. Therefore, 
the natural coordinate system is a particularly good choice for lattice energy minimization if 
the computational cost of the additional transformations is small compared to the 
computational cost of the avoided energy calculations. This is definitely the case when lattice 
energies are calculated with the hybrid method. 
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Lattice changes can be described in many different ways. Some of these changes may involve 
a rotational component. If such a coordinate change is accompanied by a corresponding 
rotation of all molecules that compensates for the rotation induced by the lattice change, the 
situation is equivalent to the one described above. 

4.3 A method for the accurate energy ranking of molecular crystals 

For each crystal in a list of polymorphic crystal structures, the corresponding local minimum 
on the lattice energy hyper surface can be determined using a lattice energy minimizer in 
conjunction with a hybrid method that combines DFT calculations and empirical potentials 
for the description of long range dispersive interactions. The cry stal structures are ranked with 
respect to the lattice energy at the local minima. 

To be able to identify the most stable crystal structure in a list of crystal structures with a 
reasonable amount of certainty, lattice energies need to be calculated with an accuracy that 
exceeds typical energy differences obtained for the structures in the list. The hybrid method 
described in 4.1 provides this accuracy, a fact that the scientific community is currently not 
aware of. 

The lattice energy at the minima of the potential energy hyper surface is only an approximate 
indicator for the relative stability of polymorphic crystal structures. In reality, the relative 
stability is related to the free lattice energy that depends on the lattice energy at the minimum, 
the energy levels of the lattice vibrations and the temperature. Even at vanishing temperature, 
the relative stability is not exactly described by the lattice energy minima alone because of the 
presence of zero-point vibrations, 

A more accurate energy ranking could be obtained if the hybrid method was coupled with a 
free lattice energy minimizer rather than a lattice energy minimizer. Alternatively, one may 
calculate a free lattice energy correction after lattice energy minimization. However, because 
of the vibrational component, the calculation of free lattice energies is significantly more time 
consuming than the calculation of lattice energies. 

Alternatively, in a first step, one may parameterize a force field or a semi-empirical method 
using the energies and energy derivatives determined with the hybrid method. Using this force 
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field or semi-empirical method* the energy ranking may be carried out in a second step. It is 
thus possible to avoid direct energy ranking using the hybrid method. Since the hybrid method 
is fairly memory and CPU time expensive, such a two step approach would help to save CPU 
time or allow calculations for bigger crystal structures. The increased speed of the 
calculations would also facilitate the calculation of free lattice energies instead of lattice 
energies. The high precision force field or semi-empirical method may also be used in other 
areas such as the prediction of crystal morphologies or the computer simulation of liquids. 

5 Mathematical details and their application 

The knowledge of the potential energy E pot (co, c nc , v, , ... vjasa function of the unit cell 
parameters c* and the atomic positions is the key to the calculation of virtually all physical 
and chemical properties. The variables Ci and v. will be defined more precisely in section 5.1. 
Within the framework of the hybrid method, the potential energy E pot is separated into two 
components, a DFT part and a Van der Waals part: 

Epot=EDFT+Edi Sp (Eq. 5.a) 

The calculation of E DF t(co, c nc , v, 5 ... v nv } and its first derivatives is described in section 
5.2, while septipn 5.3 is devoted to the determination of Edi sp (co, c nc , v l9 ... f .# OT ) and its 
first derivatives. 

In the current embodiment of the invention, E pol is used conjunction with a lattice energy 
minimizer to determine local minima on the lattice energy hyper surface. For the purpose of 
lattice energy minimization, the fractional coordinate system (c 0? . .., c nC5 Vj , ... v ttv ) is not the 
ideal choice. In section 5.4, we describe so called natural coordinate system that is based on 
lattice changes Aj, whole molecule translations Ty, whole molecule rotations py and 
delocalized internal coordinates 0y. Here the first index runs over all coordinates in a 
molecule and the second index specifies the molecule. In section 5.4, we also discuss how the 
lattice energy and its first derivatives can be obtained in the natural coordinate system. 
Knowing E po t(A,o, 7^, x u , ... x nt i 9 i, pij, ... p np i,i, ©ij, ... 0 n ei 3 iji,2, .-•) and its first 
derivatives, lattice energy minimization can be carried out easily using slightly adapted 
standard optimization algorithms such as the conjugate gradient technique or quasi-Newton 
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methods (see [Press et al 2002] for details). Since these techniques are well documented in 
the scientific literature, they are not further discussed. 

The empirical energy term E V aw contains parameters that need to be adjusted to experimental 
data. The parameter refinement procedure is described in section 5.5 and the application of 
the hybrid method to the energy ranking of crystal structures is discussed in section 5.6. 
Sections 5.5 and 5.6 both present a variety of validation results which demonstrate that the 
hybrid methods, with empirical parameters fitted to low temperature crystal structures, is 
appropriate for the accurate energy ranking of crystal structures. To further illustrate this fact 
by a simple example, section 5.7 presents a validation study on the polymorphism of N 2 . 

In this work, wc mainly concentrate on the calculation of lattice energies. Many of the known 
crystal structures of N2 have been measured under high pressure. The pressure dependence 
can easily be taken into account by calculating the lattice enthalpy rather than the lattice 
energy, i.e. by adding an enthalpy term pV ce ii to (Eq. 5. a). Here p is the constant, isotropic 
pressure and V ce ii is the unit cell volume. The determination of V ce ii and its first derivatives is 
discussed in subsection 5.3.2. 

5.1 Lattice parameters, fractional coordinates and symmetry 

For the purpose of lattice energy calculation, we define the crystal structure in terms of nc unit 
cell parameters c» and nv atomic positions^ . 

The unit cell is entirely defined by the unit cell lengths a, b, c and the unit cell angles a, (3, y. 
The number of independent, variable cell parameters C\ depends on the crystal symmetry, or, 
to be more precise, on the crystal lattice. The table below presents the definition of the 
variable cell parameters c,- for the seven crystal systems. 



Tab. 5.1. a: Relationship between the unit cell parameters a, b, c, a, J3, y and the nc variable 
ceil parameters c*: 



Crystal system 


nc 


a = 


b = 


c = 


a = 


P = 


Y = 


triclinic 


6 


cl 


c2 


c3 


c4 


c5 


c6 


monoclinic 


4 


cl 


c2 


c3 


90° 


c4 


90° 


orthorhombic 


3 


cl 


c2 


c3 


90° 


90° 


90° 
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tetragonal 


2 


cl 


cl 


c2 


90° 


90° 


90° 


hexagonal 


2 


cl 


cl 


c2 


90° 


90° 


120° 


trigonal 


2 


cl 


cl 


cl 


c2 


c2 


c2 


cubic 


1 


cl 


cl 


cl 


90° 


90° 


90° 



The unit cell parameters define the lengths of and the angles between the unit cell vectors a , 

b and c . For the orientation of the unit cell with respect to an external Cartesian coordinate 

system we use the convention that a is parallel to the x-axis while b lies in the x-y plane and 
has a positive b y component. With these definitions, the unit cell vectors are related to the unit 
cell parameters as follows: 





(a ^ 

X 




(a) 








a = 


a y 




0 






(Eq. 5.1. a) 
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(Eq. 5.1.b) 
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(Eq. 5.1.c) 
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The lattice vectors a , b and c are the columns of the transformation matrix L from 
fractional atomic coordinates to Cartesian atomic coordinates: 

L = (3,£,£) (Eq. 5AA) 

Xcart^LXfraa (Eq. S.l.e) 

At different occasions, we will require the derivatives dL I dc i of the transformation matrix L 

with respect to the variable lattice parameters c\. We do not specify these derivatives 
explicitly, as they are easily obtained for each crystal system by the application of standard 
derivation rules after the replacement of the unit cell parameters a, b, c, a 7 (3 and y in (Eq. 
5. La) - (Eq. 5.Lc) by the values and parameters indicated in Tab. 5. 1 .a. 

The positions of the atoms in the asymmetric unit are given by nv vectors v t . These 
coordinate vectors are related to fractional atomic coordinates by the following equation: 



- 18- 



WO 2005/091178 



PCT/EP2005/050778 



x 



fractj 



— W i v i 4- w i 



(Eq. 5,1 J) 



Here W i is a 3x3 matrix and w. is a vector. For atoms on a general position, W l is the identity 

matrix and w s is zero so that x fraciJ and v i are identical. For atoms on special positions, the 

situation is a little bit different Because of symmetry constraints, atoms on special positions 
cannot move freely in all three dimensions. There are tree different situations: 

1) The atom cannot move at all: We do not require a vector v, to describe its positions. 

2) The atom can move along a line: The first column of the matrix W i is a unit vector 
along this line, completed by two additional directions such that W i is an orthonormal 
matrix. Only the first component of v t is meaningful, the other two components are 
always set to zero. w { is a reference position on the line and may be chosen to be the 
starting position. 

3) The atom can move along a plane: The first two column of the matrix W) are 
orthogonal unit vectors that lie in the plane completed by one additional direction such 
that W t is an orthonormal matrix. Only the first two components of v t - axe meaningful, 
the last component is always set to zero. w f is a reference position on the plane and 



may be chosen to be the starting position. 

Each atom i in the asymmetric unit has mi symmetry copies in the unit cell. The number of 
symmetry copies m{ is called the multiplicity. In general, a symmetry operation consists of a 
3x3 matrix S and a displacement? . A list of all symmetry operations for each space can be 
found in the scientific literature [Hahn 2002]. The number of symmetry copies is not the same 
for all atoms in the asymmetric unit. Atoms on special positions have fewer symmetry copies 
than atoms on general positions and there are different types of special positions. Each set of 
symmetry operations contains at least the identity operation. For each atom i, there is a set of 
mi symmetry operations that generates all symmetry copies in the unit cell, and the j-th 
symmetry copy is given by the following expression: 




— S.- fX 




(Eq. 5.1 .g) 
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A crystal consists of an infinite number of -unit cells and x' fmciJJ does not necessarily fall into 
the same unit cell as x fmctJ . However, rt is always possible to find a lattice translation 
7 hkt ~(h 9 k 9 l) such that x r fractJJ + t hk} and x fraG£j fall into the same unit cell (h, k and 1 are integer 
numbers). 

5.2 Calculation of Edft and its first derivatives 

All DFT calculations are carried out with, the VASP program [Kresse and Hafher 1993 & 
1994, Kresse and Furthmuller 1996 & 19S>6b, Kresse and Jouber 1999, VASP 2004] whicli 
was purchased from the University of Vienna. Since the VASP program is commercially 
available and well documented in the accompanying literature, we essential adopt a black box 
approach with respect to this part of the lattice energy calculation. However, we have to 
discuss the choice of certain parameters thart affect the numerical accuracy of the calculations. 
We also describe how the energies, pressures and forces computed by VASP can be converted 
to the equivalent energies and energy derivatives used within the hybrid method. 

5.2,1 VASP settings 

The numerical results of DFT calculations depend on various approximations, in particular on 
the choice of the exchange-correlation functional and the basis set. 

VASP offers a selection of different excTiange-correlation functionals. The local density 
approximation (LDA) and the generalized gradient approximation (GGA) with the revised 
Perdew-Burke-Ernzerhof functional were discarded right from the start because — even in the 
absence of additional empirical potentials - "they predict unrealistically strong binding energies 
for simple molecular crystals such as N2 in "which Van der Waals interactions play a dominant 
role. These binding energies result from exToneous interaction energies at short interatomic 
distances and do not reflect the correct long range behaviour of dispersive interactions. If the 
Perdew-Burke-Ernzerhof functional or the Perdew-Wang-9 1 functional is employed, the 
generalized gradient approximation seems to work well in conjunction with additional 
empirical potentials. Detailed validation woxk has so far only been carried out for the Perdew- 
Wang-91 functional and all results presented in this document were obtained using this 
functional. 
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VASP uses a plane wave basis set with ultra-soft pseudopotentials or in conjunction with the 
projector augmented wave (PAW) method. All results presented in document were obtained 
with the PAW method which is considered to be less time consuming than the use of ultra- 
soft pseudopotentials at comparable accuracy. The numerical accuracy of any calculation with 
VASP strongly depends on the size of the plane wave basis set which is determined by the 
plane wave energy cut-off E cut and the k-point spacing. Accuracy and CPU time requirements 
increase with the number of wavefunctions. If the Monkhorst-Pack scheme is used to 
determine the k-point grid, a single parameter 1 is sufficient to define the k-point spacing (see 
section 5.5.2 in [VASP 2004]). 1/1 roughly corresponds to the distances between neighbouring 
k-points measured in A" 1 . Different parameterizations of the PAW method are supplied with 
VASP, For example, soft, normal and hard PAW potentials are provided for first row 
elements which are the main constituents of organic compounds. Soft potentials offer only a 
limited accuracy but are compatible with a low number of plane wave basis functions. Hard 
potentials, on the contrary, offer the best possible ultimate accuracy but require a high number 
of plane wave basis functions. All results presented in this document were obtained with 
normal PAW potentials. 

To choose appropriate values for E cu t and 1 as well as an appropriate set of PAW potentials, a 
series of single point energy calculations with increasing values of E cut and 1 was performed 
for various crystal packings of acetylene, ethylene, ethane and glycine. Using normal PAW 
potentials, it was found that the relative lattice energies are converged to within 0.01 kcal/mol 
per molecule for E cu t = 520eV and 1=15. All results presented in this document were obtained 
using these values. If hard PAW potentials are used instead of normal PAW potentials, the 
energy cut-off needs to be increased from 520 eV to 910 to achieve similar convergence. The 
converged relative lattice energies obtained with normal and hard PAW potentials turned out 
to agree to within less than 0.01 kcal/mol In summary, it can be said that relative lattice 
energies can be obtained with a numerical accuracy of about 0.01 kcal/mol per molecule 
(tested only for small organic molecules) if VASP is used with normal PAW potentials and 
E C ut— 520eV and 1=15. It is important to add that this accuracy is only obtained if the k-point 
grid of each crystal structure has at least two k-points along each direction of the reciprocal 
lattice. For crystal structures with large unit cells in direct space, additional k-points need to 
be added to the Monkhorst-Pack grid obtained with 1=15. 
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5.2,2 Transformations 

For every energy calculation, VASP creates an OUTCAR filo that contains among other data 
the lattice energy, the forces on all atoms in the unit cell and the pressure tensor. We now 
discuss the transformations that are required to convert these results to the energy and the 
energy derivatives used by the hybrid method. 

The lattice energy in the OUTCAR file is given in eV, while the energy unit used with the 
hybrid method is kcal/moL The following conversion factor is employed: 

Cev->kcai/moi = 4.3393 14* 10" 2 kcal/mol/eV (Eq. 5.2.2.a) 

The same conversion factor is used to convert forces frorxi eV/A to kcal/moI/A and the 
components of the stress tensor from eV to kcal/mol. For all atoms in the unit cell, the net 
force Fcartjj (Force on symmetry copy j of atom i in the asymmetric unit) is specified in the 
OUTCAR file with respect to the external Cartesian coordinate system in which the lattice 
vectors are defined. The energy derivative BE mr /5v. with respect to the atomic positions in 
the factional coordinate system can be calculated from these forces. If the coordinate vector 
v. changes by Av ; , the Cartesian displacement of the j-th symmetry copy of the i-th atom is: 

&ca«,tj = LStjWAV; (Eq. 5.2.2.b) 

The matrices L , 5 iy and W f are defined in section 5.1. For small displacements, the energy 
change is given by the following expression. 

AE DFr = Av? *^ 3L = X^L t jj*F earOJ (Eq. 5.2.2.C) 

Inserting (Eq. 5.2.2.b) into (Eq. 5.2.2.c) and taking into account that the equation must hold 
for all Av,. that are sufficiently small, we obtain: 

^ = 2X' *S; J j*r* F M (Eq. 5.2.2.d) 
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The stress tensor z is specified in the VASP OUTCAR file. For small lattice clianges, the 
strain tensor s and the stress tensor r are related to the to the total energy change by the 
following equation: 

3 

^ = ZVw (Eq. 5.2.2.e) 

We now express the strain tensor s in terms of the transformation matrix Zand its 
derivatives with respect to the cell parameters c { . If the transformation matrix changes from I 
to L ' = L + Ac t * 3Z / Sc. , a point x cart in Cartesian coordinates is moved to: 

K art = L'L~ l = (Z 4- Ac,. = 1+ Ac,. = 7+ S (Evq. 5.2.2.f) 

Here J is the identity matrix. The last equality defines the strain tensor^. Inserting this 
definition into (Eq. 5.2.2.e) and taking into account that the energy change is also given by 
AE DFT = Ac f . * d£ D/<Y . /5cv , we obtain: 

"^f = § ( i^ X ^ (E^5.2.2.g) 
5.2.3 CPU time reduction 



In the case of centered crystal structures (e.g. space group C 2/c), the CPU time requirements 
can be significantly decreased if the DFT calculations are carried out for the (smaller) reduced 
cell rather than for the standard unit cell. Further transformations are required in tfhis case to 
transform the forces and pressures obtained for the reduced cell to the standard unit <sell. 

Another way to save CPU time is to use the wavefunction from a previous calculation as a 
starting point for a new energy calculation if the difference of the corresponding crystal 
structures is small. This approach is commonly used by DFT structure optimization 
algorithms- For lattice energy calculations with variable unit cell parameters, it has to be taken 
into account that the basis set obtained for a fixed energy cut-off depends on the unit ceil. As 
the unit cell changes, the plane wave energy levels change as well. It would be very 
inconvenient to change the basis set each time an energy level crosses the energy cut-off, 
because basis set changes result in small discontinuities of the lattice energy. Therefore, 
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lattice energy minimizations are carried out with a constant basis set and the wavefunction 
from a previous calculation can be used as a starting point for the next calculation. However, 
to calculate reliable lattice energies, it is important to redefine the basis set at the end of the 
lattice energy optimization and to start a second lattice energy optimization using the new 
basis set. This procedure needs to be repeated until the energy changes between successive 
lattice energy optimizations are small enough. 



5.3 Calculation of Evaw and its first derivatives 
5.3*1 Energy 

In this work, we use atomic pair potentials Ea,b to describe long range dispersive interactions. 
The total interaction energy E d [ S p per unit cell can be obtained by shimming over all relevant 
atom pairs: 

*-*-7T- + £££ Z (Bq. 5.3.1*) 

V cell A B J=\ h,kj £ 
0<\Ar\zR c 

A ? = Xcart,A - *cart,BJ ~ ^cart^Jc,! (Eq. 5 .3 . 1 .b) 

Xtan.A = HS Afi (W A v A + wj + s A0 ) (Eq. 5.3.1.C) 

Xcan.Bj = L(S BJ {W B v B + w B ) + s BJ ) (Eq. 5.3.1 .d) 

(ti\ 



X carf,hJcJ ~ ^ 



(Eq.5.3.Le) 



In (Eq. 5.3.1. a), the first two sums both run over all atoms in the asymmetric unit. The third 
sum covers all nie symmetry copies of atom B in the unit cell. Finally, the last sum runs over 
all lattice translations of the i-tli symmetry copy of atom B for which the distance |Ar| with 

respect to atom A is bigger than zero (-> no self interaction) and smaller than a cut-off radius 
Rc. The interaction energy Ea,b of each atom pair is multiplied with the number of symmetry 
copies m A of atom A in the asymmetric unit. The factor Yz avoids double counting. The term 
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KIV CitU corrects for the systematic underestimation of Edisp due to the use of a finite cut-off 
radius R c and will be further discussed below. (Eq. 5 3. Lb)- (Eq. 53.1.e) define the distance 
vector Ar , The matrices and vectors used in these equations are defined in section 5,1, 

The pair interaction energy Ea,b is the product of a damping function f^B, a spline function 
gA,B and a C 6 A B Ir\ B term that describes the asymptotic behaviour of long range dispersrve 
interactions: 



E A ^(r)^~f AiB {r) 



sir) 



(Eq, 5.3,l.f) 



Here we have used r=]Ar] , C6 ? a,b is a constant that is different for each pair of atom types. Tlie 
determination of the coefficients is discussed in section 5.5. 

For the damping function, we use a generalized version of the damping function employed fc>y 
Wu and Yang [Wu and Yang 2002]: 



1 — exp 



< r ^ 



(Eq. S3.1.S) 



In the paper of Wu and Yang, c is set to 3.54, n is set to 1 .0 and the damping radii r Ar B are ttie 
sum of Van der Waals radii taken from literature [Bondi 1964], In this work (see section 5.5), 
we adjust the damping radii r A? B to experimental data. We can therefore choose c=l .0 without 
loss of generality. In addition, we also adjust the form factor n to experimental data. Fig. 3 
illustrates the influence of the form factor n by showing fA,B(r)*r" 6 for three different values of 
the form factor with rA,s=3.0. The smaller the form factor, the harder the transition from ttie 
long range r" 6 part to the constant short range part. As illustrated by Fig. 3, n determines tine 
hardness of the damping function. For the sake of simplicity, we use the same form factor for 
all atoms pairs. The generalization to atom type dependant form factors is straight-forward. 



The spline function in (Eq. 53 AS) is required to avoid discontinuities. Without the spline 
function, the total dispersion energy Edisp would change discontinuously each time that tfcie 
distance of an atom pair crosses the cut-off distance R<> The spline function progressively 
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turns on the atom pair potentials between the outer radius R c and an inner radius R s . We use a 
spline function with continuous first and second derivatives: 



-6x 5 +15x 4 -10x 3 +l 
0 



r < R s 

R, <r<R <: ,x = r ~ R * (Eq. 5.3 .1 .h) 



To complete the instructions for the calculation of Edi sp9 we need to derive the correction 
term £7 . We first consider the interaction of an atom A in the asymmetric unit with the 

symmetry copies of an atom EL The average density of these symmetry copies is p B =mB/V cell. 
Assuming a continuous distribution of the symmetry copies of B ? one can calculate the 
missing interaction energy due to the use of a spline function (and a cut-off radius): 

Z R s r * ceil R s T 

(Eq, 5.3.1-i) 

For the second (approximate) equality we have exploited the fact that f*A,B is practically equal 
to one for large interatomic distances. The factor Vz reflects the fact that only half of the 
interaction energy is attributed to atom A to avoid double counting. To obtain the total 
correction energy per unit cell Ecor, we take into account that A has itia symmetry copies and 
let A and B run over all atoms in the asymmetric unit: 

E cor = <Eq- 5-3.1 j) 

A B V cell 

K = _ jll^^X 2 ^ (Eq. 5.3.1.k) 

R s r A B "cell 

Using the spline function g(r) from (Eq. 5.3. Lh) and standard integration rules, the integral in 
(Eq. 5.3. l.k) can be determined analytically. Appropriate values for the inner spline radius 
and the cut-off radius are R S =15A and R C =18A. 



5.3.2 Derivatives 

The calculation of the first derivatives of Edi sp from equation (Eq. 5.3.1. a) with respect to the 
cell parameters Ci and the atomic positions is fairly straight forward: 
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dE. 



K dV 



cell 



- + / ~ m A — — yAr) 1 - zr Ar r — - 

0<|Ar|</< E 



Sv, "^4^ m 2'"' dr 

0<|A?|s« e 



A? I dv 



A? 



(Eq. 5.3.2.a) 
(Eq. 5.3.2.b) 



i J 



In the second equation, dE^ ( dv- is a column vector and 3AF / dv } is a 3x3 matrix (see 
below). The derivative dE AB I dr is easily determined analytically using (Eq, 5,3. l.f) - (Eq. 
5.3.1.h) and standard derivation rules. The cell volume V ce n is related to the transformation 
matrix L via the following equation: 

V C ei! ~ det(X) = L xx L yy L Z2 4- L yx L zy L X2 4- L^L^L^ — L^L^L^ - L yz L zy L xx ~ L^L^L^ 

(Eq. 53,2.c) 

Taking into account that dL/dc^s known (see section 5.1) 5 dV celi /3c, in (Eq. 5.3.2.a) can be 
obtained from (Eq. 5.3 .2.c) using standard derivation rules. The derivatives of Ar with 
respect to c* and v. are given by the following equations: 



3Ar = dL 
dc t dC; 



* fractal ^ fractjij 



V 



(h 



* Jract y BJ = S BJ (W R V B + W s ) + ? a ^ 

In the last equation, <J a ,, is equal to one if a=b and 0 otherwise. 



(Eq. 5.3. 2.d) 

(Eq. 5.3 .2.e) 
(Eq. 5.3.2.f) 

(Eq. 5.3.2.g) 



5.4 'Natural coordinates' for the lattice energy minimization of molecular crystals 
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In this section we present a coordinate system (called the natural coordinate system) that is 
more appropriate for lattice energy minimization than the coordinate system used in the 
previous sections (called the fractional coordinate system). Using the new coordinate system, 
energy minimization can be carried out using standard minimization routines such as the 
conjugate gradient technique [Press et al 2002]. The conjugate gradient technique requires the 
knowledge of the energy and its first derivatives. In the previous two sections, we have 
already shown how the energy and its first derivatives can be calculated in the fractional 
coordinate system. In this section, we will explain how natural coordinates can be converted 
to fractional coordinates and how energy derivatives obtained in fractional coordinates can be 
converted to energy derivatives with respect to natural coordinates. 

5.4.1 Example — why the coordinate system is important 

The choice of the coordinate system is crucial for the efficiency of the energy minimization 
process. To illustrate this point, let us consider the energy minimization of an isolated water 
molecule (see Fig. 4 which shows internal coordinates and Cartesian coordinates for water.). 
The point group for water is C2 V - Taking symmetry into account, we can describe the 
geometry of the water molecule by two parameters, for instance the H-O bond distance r and 
the H-O -I I bond angle G (internal coordinates). Alternatively, we may use the Cartesian 
coordinates x and.»y of one of the hydrogen atoms to describe the molecular geometry, 
keeping the oxygen atom fixed at the origin. 

We define a potential energy surface in internal coordinates by the following expression: 

(Eq. 5.4.1. a) 

For illustration purposes we have deliberately chosen a value for the H-O stretch force 
constant that is 10 times higher than realistic values. The conversion to Cartesian coordinates 
is straight forward using: 

r = V* 2 + y 2 (Eq. 5.4. 1 .b) 

0 = 2 arcsinO / r) (Eq. 5.4. 1 x) 



-28- 



WO 2005/091178 



PCT/EP2005/050778 



Fig. 5 shows contour plots of the potential energy surface for different definitions of the 
dimensionless parameters qi and q 2 , wherein contour lines represent 10 kcal/mol intervals. 

In Fig. 5, the potential energy surface is shown as a function of two dimensionless parameters 
qi and q 2 for three different cases: 

1 ) Cartesian coordinates : qi=x/A ? q 2 =y/A 

2) Internal coordinates: qi=r/A, q 2 =0/rad 

3) Scaled internal coordinates: qj=4 r/A, q 2 =0.5 9/rad 

In Cartesian coordinates (case 1) the potential energy surface is a curved valley. This kind of 
potential energy landscape is unfavourable for energy minimization. Most optimization 
algorithms proceed by a series of line searches. They first fall into the valley and than follow 
the valley to its lowest point. Since the valley is curved, this procedure may require a lot of 
steps. In internal coordinates (case 2), the potential energy surface is usually much more 
haxmonic than in Cartesian coordinates. In our particular case, it is perfectly harmonic by 
definition. In a harmonic potential well, the conjugate gradient algorithm finds the minimum 
in about n steps, where n is the number of variables. Using internal coordinates, the 
^convergence is usually much faster than using Cartesian coordinate?. In case 2, the potential 
energy well is harmonic, but the energy increases much faster in one direction than in the 
other. In extreme cases, this type of situation can lead to convergence problems. Using 
properly scaled internal coordinates (case 3), one obtains a more isotropic potential that is 
more favourable for energy minimization. 

5.4.2 Natural coordinates for the lattice energy minimization of molecular crystals 

In molecular crystals, atomic displacements of similar size can result in very different 
potential energy changes. On the one hand, structural changes the strongly affect bond lengths 
and bond angles are usually accompanied by important potential energy changes. On the other 
hand, there are concerted atomic motions that keep the bond lengths and bond angles constant 
while modifying intramolecular torsion angles and/or the molecular arrangement in the unit 
cell. Such concerted displacements are called weak modes as they result in large atomic 
displacements at small energetic cost. In Cartesian coordinates, weak modes often correspond 
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to complex, curvilinear coordinate changes. In analogy to the water example discussed above, 
it is important for efficient lattice energy minimization to choose a coordinate system in 
which weak modes correspond to more or less straight lines. 

In molecular crystals, intramolecular interactions are usually significantly stronger than 
intermolecular interactions. Therefore, weak modes and strong modes can be decoupled to a 
great extend by the use of separate coordinates for the description of whole molecule 
translations and rotations on the one hand and for the characterization of the molecular 
geometry on the other hand (see Fig. 6). Fig. 6 shows degrees of freedom describing atomic 
displacements in the unit cell; 1) Whole molecule translations, 2) whole molecule rotations, 3) 
intramolecular coordinate changes, the two molecules in the unit cell being symmetry related. 
To describe whole molecule translations, we use the position of the molecular centre (average 
over all atomic positions) in fractional coordinates. The orientation of the molecule is defined 
in terms of three successive rotations around mutually perpendicular axes. For the description 
of the intramolecular degrees of freedom we use delocalised internal coordinates which have 
shown to be an excellent choice for the structure optimization of isolated molecules [Baker 
1997; Baker etal 1996]. 

So far, we have dealt with atomic displacements in a fixed unit cell. We now turn our 
attention to unit cell changes. Lattice deformations belong to the weakest modes in molecular 
crystals and it is important to choose a coordinate system in which the degrees of freedom 
relating to the unit cell and those relating to the atomic positions are more or less decoupled. 
The degree of coupling is strongly linked to the atomic displacements that are induced by unit 
cell changes. To illustrate this point, it is helpful to consider the case of fractional atomic 
coordinates as an example for a fairly inefficient choice of coordinates. If fractional 
coordinates are used to define the atomic positions in the unit cell, these fractional coordinates 
are usually held constant when the unit cell changes. As a consequence, unit cell changes, and 
in particular volume changes, are accompanied by strong changes of the covalent bond 
lengths resulting in strong changes of the net forces acting on the atoms. Cell parameters and 
atomic coordinates are thus strongly coupled. 

In this work, we describe unit cell changes in terms of an anisotropic expansion/compression 
of a starting cell along three mutually perpendicular axes (see next subsection and Fig. 7). Fig. 
7 shows the molecular centres following the deformation of the lattice, while the molecular 
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geometry and the molecular orientation remain unchanged. The molecular centres follow the 
expansion/compression (i.e. the fractional coordinates of the molecular centres remain 
constant) but the molecular geometries and orientations remain the same (i.e. the Cartesian 
coordinates of the atoms in a molecule with respect to the molecular centre remain constant). 
Our choice for the connection between lattice changes and atomic coordinate changes 
guarantees that lattice deformations only weakly affect the forces that are acting on the atoms 
in the unit celL Let us suppose that the atomic coordinates correspond to the lattice energy 
minimum for a given set of unit cell parameters. In such a case, the forces on all atoms, and as 
a consequence the net forces and net torques on all molecules, are zero. If the unit cell 
changes, forces and torques change only little. The resulting net force on each molecule is 
small, because the expansion/compression along a given direction changes the intermolecular 
distances on both sides of the molecule in the same way. The resulting net torque on each 
molecule is small, because the expansion/compression does not induce a rotation of the 
molecule with respect to its neighbours. Finally, the change of the intermolecular distances 
results in small additional forces on all atoms in the unit cell, but these forces are much 
smaller than the forces that would have resulted from a change of the intramolecular distances 
rather than the intermolecular distances. 

In the following four subsections, we present a more mathematical description of the natural 
coordinate system and its relationship with the fractional coordinate system. 

5.4.3 Lattice changes 

We describe lattice changes in terms of a transformation A/ from initial lattice vectors a 0 , h Q 
and c 0 to new lattice vectors a , b and c : 

a = Ma Q , b - Mb 0 , c — Mc 0 (Eq. 5A3.a) 

L = ML 0 (Eq. 5.4.3.b) 

The vectors a Q , b 0 and c Q are oriented with respect to the external Cartesian coordinate 
system as described in section 5.1. The lattice vectors a Q , b 0 and c 0 are the columns of the 
transformation matrix L Q from fractional atomic coordinates to Cartesian atomic coordinates. 
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The matrix M can always be decomposed into the product of an orthonormal matrix and a 
symmetric matrix (see (Eq. 5.5.1.d> (Eq. 5.5. l.f)). The symmetric matrix corresponds to an 
anisotropic expansion/compression along three mutually perpendicular axes. The orthonormal 
matrix corresponds to a rotation of the crystal lattice. Since a rotation does not change the 
lattice parameters, we can drop the rotation without loss of generality and impose that the 
transformation matrix M is S3'mmetric. 

The new unit cell parameters a, b 3 c, a, p, and y can be obtained from the new lattice vectors 
a , b and c using the following equations: 



a-4a T a 



180° 
a — arccos 

7C 



1 h c x 



* 180° 
p = arccos 

71 



r a T c 



ac 



(Eq. 5.4.3.C) 
(Eq. 5.4.3.d) 
(Eq. 5.4.3.e) 

(Eq. 5.4.3 .f) 
(Eq. 5.4.3.E) 



180° 
y = arccos 

TV 



a b x 



ab 



J 



(Eq. 5.4.3 .h) 



The unit cell parameters can be converted to the independent, variable cell parameters Ci by 
means of Tab. 5.1. a. 

In all crystal systems apart from the triclinic one, the symmetric transformation matrix M 
must obey additional symmetry constraints. In the orthorhombic case, for instance, all of 
diagonal matrix elements must be zero because the lattice angles are fixed to 90°. We 
therefore decompose the transformation matrix M into a sum over independent, symmetry 
allowed transformations: 



(Eq. 5.4.3 .i) 
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In the above equation, I is the identity matrix, k ? is a scale factors and the matrices M f are 
defined in Tab. 5 A3, a for the various crystal systems. The parameters /I, are the coordinates 
that are used in the natural coordinate system to describe lattice changes. There are as many 
parameters Z f in the natural coordinate system as there are parameters Cj in the fractional 
coordinate system. 

Tab. 5.4.3.a: Components of the transformation matrix M used to describe lattice changes. 
In the trigonal case, e x 9 e 2 and e 3 are mutually perpendicular unit vectors with e x being 



parallel to a Q + b Q 4- c {) . 



Crystal system 


nc 




M 2 


M s 


M 4 


M 5 


^6 


triclinic 


6 




f \ 0 0^ 
0 0 0 

,° 0 °, 






r O 0 
O 1 0 
[o 0 0, 






'0 0 0^ 
0 0 0 
,0 0 1, 






'0 1 0^ 
10 0 
k 0 0 0, 






f o o r 

0 0 0 






'0 0 0^ 
0 0 1 

^o i oj 


monoclinic 


4 




f \ 0 0^ 
0 0 0 
,0 0 0, 






r Q 0 0^ 
O 1 0 

sP 0 °J 






f 0 0 0^ 
0 0 0 
,0 0 1, 






r 0 0 P 
0 0 0 
,1 0 Oj 








orthorhombic 


3 




f \ o 6 s 
0 0 0 
,0 0 0, 






r Q 0 0> 
O 1 0 
K o 0 0^ 






'0 0 0^ 
0 0 0 
,0 0 ij 










tetragonal 


2 




f \ 0 0^ 
0 1 0 

,0 0 0, 






r o o 6 s 

O 0 0 

yP 0 1, 












hexagonal 


2 




r l 0 o'' 

0 1 0 

,0 0 0, 






r O 0 0^ 
O 0 0 

P o ij 












trigonal 


2 




e^el + 










cubic 


1 




'1 0 0> 
0 1 0 
0 0 ij 















In subsection 5.4.1, we have seen that it is important to scale all coordinates in such a way the 
size of the potential energy well is roughly the same in all directions. In this work, the scale 
factor k x is set to 0.02. 



5-4.4 Whole molecule translations and rotations 
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Crystal structures are usually defined in terms of lattice parameters and fractional atomic 
coordinates. Molecules are not specified at this basic level. The number of independent 
molecules as well as their composition and symmetry need to be derived from the fractional 
coordinates and the crystal symmetry. In this subsection, we first define a molecule in terms 
of its degrees of freedom and than discuss how the molecular composition, the molecular 
coordinate system and the molecular symmetry operations can be derived from the crystal 
symmetry and the fractional atomic coordinates. 

For each molecule i, there is a set of r\ t symmetry independent atoms that constitute the 
molecular asymmetric unit. The position of each atom is specified in terms of a vector 3 i with 
respect to a molecular Cartesian coordinate system. Each atom i in the asymmetric unit has m 
symmetry copies at positions & u with: 

& u =r o 4 (Eq. 5.4.4.a) 

Here T UJ is a 3x3 matrix and T iJQ vs the identity matrix, i.e. i9 i0 = «9 i . In the molecular 
coordinate system, the geometrical centre of the molecule coincides with the origin: 

E£v,=0 (Eq.5A4.b) 

m y=i 

Some of the atoms in the asymmetric unit may lie on a symmetry element such as a rotation 
axis or a mirror plane. To consider only symmetry allowed atomic displacements, we 
decompose the Cartesian positions as follows: 

3 ^.o + fXA/ (Eq.5A4.c) 

Here *9 i0 is a reference positions. The cr l orthonormal vectors u tJ correspond to the 
symmetry allowed displacement directions and the dimensionless parameters £ij determine 
the atomic position. All vectors in (Eq. 5 A4.c) are measured in A. 
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Whole molecule rotations are defined in terms of np successive rotations around some 

or all of the axes of the molecular coordinate system. In addition, a rotation R 0 needs to be 
applied to transform the atomic coordinates from the molecular coordinate system to the 
external Cartesian coordinate system. The overall rotation R is given by: 

_ _ ?p_ 

R = XoYlMPt) 5AAA) 

w 

Due to symmetry constraints, there may be less than three successive rotations. For a 
molecule with a mirror plane, for instance, only rotations around an axis perpendicular to the 
mirror plane are possible. There may be 0, 1 or 3 allowed rotations and we always choose the 
molecular coordinate system in such a way that the rotation axes coincide with axes of the 
molecular coordinate system. Depending on the rotation axis, the matrices ) can take 

different forms specified in Tab. 5AA.su The parameters p, are the coordinates used in the 
natural coordinate system to describe whole molecule rotations. All rotation matrices contain 
a scale factor K p . In this work we use ^=0.03. 



Tab. 5 A4.a: Matrices describing rotations around axes of the molecular coordinate system. 
k is a scale factor and p i is a rotational coordinate in the natural coordinate system. 



Axis 


* f (A) = 


X 




'10 0 ^ 
0 cos K p p l simc p p i 
0 - sin K p p t cos K p p tJ 




Y 




' COSKpP, 0 SVCLK p p^ 

0 1 0 

^-sin^p,. 0 cos K p p, j 




Z 




f COS K p p t SV&KpPt 0^ 
— SYO.K p p l COS «•„/?, 0 

, 0 0 1, 





In order to define the position of the molecule in the unit cell, we specify the position of the 
centre of geometry in fractional coordinates. If the centre of geometry overlaps with a 
symmetry element of the crystal structure, there are less than 3 independent move directions 
for the molecule. In analogy to (Eq. 5 A4.c), we write: 
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T = ?o + Zw6 (Eq.5.4.4.e) 



Here f 0 is a reference position, co specifies the number of symmetry allowed move 
directions and/c r is a scale factor set to 0.1. The parameters r, specify the molecular position 
in the natural coordinate system. The independent move directions £t 3X0 defined in such a 
way that the vectors Z~ 0 £ are orthonormaL Using (Eq. 5.4.4.d) and (Eq. 5.4.4.e), the atomic 
positions in molecular Cartesian coordinates can be converted to fractional atomic 
coordinates: 

x^-T + r- 1 ^ (Eq. 5.4.4.f) 

The atoms in the molecular asymmetric units are mapped onto the atoms in the asymmetric 
unit of the crystal on a one to one basis. Each atom i in the asymmetric unit of the crystal is 
related to an atom at(i) in a molecule mol(i), and each atom i in a molecule j is related to an 
atom cry(ij) in the asymmetric unit of the crystal: 

cry(at(i) 9 mol(i)) « i (Eq. 5.4 Ag) 

at(cry(ij)) = i (Eq, 5.4.4.h) 

mol(cry(ij)) = j (Eq. 5 A4.i) 

As there can be more than one independent molecule, most scalars, -vectors and matrices in 
(Eq^ 5.4.4.a) - (Eq. SAA.f) should carry an additional index to indicate the molecule they refer 
to. However, for the sake of simplicity we omit the second index whenever this does not lead 
to confusion. 

In general, it is necessary to apply a symmetry operation to an atom i in the asymmetric unit 
of the crystal in order to obtain the corresponding atom in the molecular asymmetric unit: 

= StfPft + w t ) + 1 (Eq. 5.4.4 j) 
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Combining (Eq. 5A4.f) and (Eq. 5.4.4.J), we obtain the transformation ftom molecular 
Cartesian coordinates to the coordinate vectors v. that describe the atomic positions in the 
factional coordinate system, 

v, ^Wr^r l (f mom +L*R m , <o **v**(0 CEq. 5.4.4.k) 

The above equation allows us to describe the atomic displacements in the unit cell in terms of 
whole molecule translations (via T wo/(/> ), whole molecule rotations (via R mol{i) ) and changes 

of the molecular structure (via ^ /(/))W70 / (/) ). The coordinate transformations involves vectors 

(such as S> , T 0 ? (j i9 etc), matrices (such as S i9 R Q , etc) and mappings (such as mol(i)-, at(i), etc) 

that need to be derived from the crystal symmetry and the initial positions of the atoms in the 
asymmetric unit. The second half of this subsection deals with the determination of the above 
mentioned objects. 

As a starting point, we suppose that we know the fractional coordinates x i of all atoms in the 

asymmetric unit, their multiplicity mi and the corresponding symmetry operations (see (Eq. 
5.1.g)). Using this information, we can generate the atomic positions of all atoms in a central 
unit cell and the surrounding unit cells. Our first task is to identify the molecules that the 
atoms in the asymmetry unit belong to. For our purposes, we define a molecule as a set of 
atoms where every two atoms are connected via a series of covalent bonds. Two atoms i and j 
are covalently bonded if their distance is smaller than a certain multiple of the sum of their 
covalent radii. We use a multiple of 1.3 with the covalent bond radii presented in Tab. 5.4.4.a. 



Tab. 5.4 Ab: Covalent bond radii for some elements. 



Element 


Covalent radius [Al 


H 


0.299 


B 


0.830 


C 


0.767 


N 


0.702 


O 


0.659 


F 


0.619 


CI 


1.023 


s 


1.052 
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To attribute all atoms in the asymmetric unit to a molecule, we start of with the first atom and 
we find all atoms in the central unit cell or in adjacent cells that are connected to the atom at 
the position S^x, 4-? u via a series of covalent bonds. We have now identified the first 

molecule which contains symmetry copies of a certain number of atoms in the asymmetric 
unit and which therefore determines the positions of these atoms. If there are any atoms in the 
asymmetric unit that do not belong to the first molecule, we take one of these atoms and we 
determine the corresponding second molecule. The procedure is repeated as long as there are 
atoms in the asymmetric unit that have not yet been attributed to a molecule. At the end of this 
procedure, the lookup table mol(i) is fully defined for all atoms i in the asymmetric unit. 

Each molecule needs to be further processed. For the sake of simplicity, we again omit the 
index related to the number of the molecule in our discussion. Let us suppose that the 
molecule consists of n atoms at positions x'j. It is straight forward to calculate the centre of 

geometry T 0 that serves as a reference position for molecular translations: 



Knowing the molecular centre, we can determine the symmetry allowed translations and 
rotations. For a given space group, there are m gcn general symmetry operations that can be 
described in terms of a matrix S i and a translation s f (see (Eq. 5.1 .g)). We first determine the 

subset of the m salf symmetry operations ( SJ , 5/ ) that project the molecular centre T 0 onto itself 
to within a translation by a lattice vector: 



(Eq. 5.4.4.1) 




(Eq. S.4.4.nx) 



Here hj, kj and lj must be integer numbers. A molecular displacement <f is symmetry allowed 
if and only if: 



g = Sg foralliwithl <i< m s 



(Eq. 5.4.4.n) 
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It is thus possible to determine a complete set of symmetry allowed displacements £ by 
solving the system of linear equations defined by (Eq. 5.4Axi). We define the displacement 
vectors £ such that the vectors Z 0 £ are orthonormal. 

We now turn our attention to the determinations of the symzmetry allowed whole molecule 
rotations. For the discussion we choose a Cartesian coordinate system with the origin at 
f 0 and axes parallel to the axes of the external Cartesian coordinate system. Any atom at a 
general position Fin this coordinate system has m self symmetry copies at the following 
positions: 

with Zj » L 0 S!L- } (Eq. 5 A4.o) 

One of the symmetry operations Z,must be the identity operation. If we carry out a small 
rotation by an angle A<p around an axis through the origin <3efined by a unit vector n , the 
change of the vector r is given in the linear approximation by - 

Ar = A<pnxr (Eq. 5.4Ap) 

The coordinate change of the symmetry copies can be obtained either by applying the 
symmetry operations to<(Eq. 5.4 Ap) or by applying the rotation to the result of (Eq. 5.4 Ao). 
Since both approaches must yield the same result, one obtains a series of conditions for the 
vector n : 

Z£n x r) = n x (Z,r) for all r and 1 < i < m self (Eq. 5.4 Aq) 

It can be shown that above conditions are fulfilled if and only if: 



n T Pij = 0 for all p tJ from Tab. 5.4Ac and 1 <i< r*3 self (Eq. 5 A4.r) 

Tab. 5.4 Ac: Vectors p fJ required in (Eq. 5.4Ar) ? Z ijk is an element of the matrix Z i . 
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Ptj 
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Pu 
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Pu 
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0 ^ 
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( Z \ 
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-Z \ 








— Z i.ac ~ Z lja 
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— Z 








^ Z i,yx + Z i^y j 
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z — z 




5 




( 7 +7 
0 


\ 
/ 


8 




(Z -Z \ 

Z i>yx 
— Z 


3 




f -z ~) 

Z — 7 




6 


(z -Z ^ 




9 




r -z - z \ 
Z- + z. 

l,XZ 1 l,ZX 

< 0 J 



The vectors p LJ span a subspace of the three-dimensional coordinate space. The dimension of 
this subspace can be 0, 2 or 3. If the dimension is 3, there are no symmetry allowed rotations. 
If the dimension is 2, there is one symmetry allowed rotation axis n f . Finally, if the dimension 
is 0, all rotations are allowed. Knowing the vectors p tJ9 the allowed rotation directions are 
easily calculated. 

We now turn to the determination of the transformation matrix R 0 from the molecular 
Cartesian coordinate system to the coordinate system with the origin at T 0 and axes parallel to 
the axes of the external coordinate system (shifted external coordinate system). We always 
chose the matrix R Q such that the axes of the molecular coordinate system are parallel to the 
principal axes of the 'inertia tensor 5 calculated with all masses set to 1 for the initial 
molecular geometry. The molecular x-axis and z-axis correspond to the smallest and the 
largest 'moment of inertia% respectively. The principal axes and the principal moments reflect 
the shape at the molecule and it can be expected that there is some kind of correlation between 
these axes on the one hand and the directions of weakly/strongly hindered rotations on the 
other hand. It is therefore advisable to use the principal axes as rotation axes for whole 
molecule rotations. 

We define the 'tensor of inertia 9 with respect to the shifted external coordinate system as 
follows: 

r, , _ 2 _ _ _ 

® = Z|£o(*;-T 0 )| /-i 0 (^-T 0 )(x;-T 0 ) r 2J (Eq.5.4As) 

M 
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R Q is the orthonormal matrix that diagonalizes © , i.e. the eigenvectors of 0 are the columns 
of R,: 

&R Q =R 0 A (Eq.5.4At) 

Here A is a diagonal matrix with < A }y < . Since we describe whole molecule 

rotations in terms of rotations around the axes of the molecular coordinate system, we always 
have to make sure that the allowed rotations axes coincide with the molecular axes. This is 
automatically the case if all or no whole molecule rotations are allowed, but the case of a 
single allowed rotation axis requires further discussion. In principle, the allowed rotation axis 
always corresponds to a principal axis due to symmetry considerations. However, it may 
happen accidentally that the 'tensor of inertia' has two or three degenerate (identical) 
eigenvalues. In this case, standard diagonalization routines do not necessarily yield 
eigenvectors with one eigenvector parallel to the allowed rotation axis and the eigenvectors 
need to be redefined. If no eigenvector is parallel to the allowed rotation axis, we distinguish 
two cases: 

Allowed rotation axis n perpendicular to one of the eigenvectors e . The new 
eigenvectors are n, e and a vector perpendicular n and e . 

Allowed rotation axis n perpendicular to none of the eigenvectors. The new 
eigenvectors are n and two additional vectors perpendicular to n and to each 
other. 

Once the transformation matrix R 0 is determined, it is straight forward to choose the rotation 

matrices R. . If there is a single allowed rotation, R f must correspond to the allowed rotation 

axis in the molecular coordinate system. If all three rotations are allowed, we choose R l = R x , 

R 2 = R y and R^=R z .Tn certain cases, for instance when dealing with linear rigid molecules, 

it can be appropriate to disable the rotation around the axis with the smallest 'moment of 
inertia' 

We still need to specify the atoms in the molecular asymmetry unit, thLeir mapping onto the 
atoms in the asymmetric unit of the crystal and their symmetry copies within the molecule. To 
obtain this information, we examine all atoms of the molecule one by oae. Let us suppose that 
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we are considering atom j of the molecule at the position Xj which is related to the m-th 
symmetry copy of atom i in the asymmetric unit of the crystal via: 

= S g „ (Wft + w f ) + s itm + t iJnJ (E<q. 5.4 Au) 

We need to distinguish two cases: 

Case 1: So far, we have not come across an atom within the molecule that is related to the 
atom i in the asymmetric unit of the crystal. We add the atom j to the atoms in the molecular 
asymmetric unit. The atom gets a new index j 9 which is equal to the current numbesr of atoms 
in the asymmetric unit. We set cry(j\mol)=i and at(i)=j\ For the symmetry elements that 
relate atom j ? in the molecular asymmetric unit to atom i in the asymmetric unit of the crystal 
we obtain: 

The position of atom j' in the molecular Cartesian coordinate system is: 

3 AQ = R 0 -%(x' f - T 0 ) (Bq. 5.4.4.w) 

Symmetry allowed atomic move directions for atom j ? can be obtained from the* following 
equation: 



u f fJk =Ro l L Q S i W i e k , \<k<a f , e k = 



rn 

0 

PJ 



(E«q. 5.4.4.x) 



Here only such vectors e k are allowed that are compatible with the definition of tine relevant 
(non-zero) components of the vectors v^in section 5.1. The vectors can be obtained from 
the vectors v r r k by orthonormalization. 

Whenever we add a new atom j 9 to the molecular asymmetric unit, we set the zn umber of 
symmetry copies //of atom j 9 to 1. There is always at least one symmetry copy r^, which is 
equal to the identity matrix. 
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Case 2: We have already come across an atom J that is related to the m'-th syinmetry copy 
of atom i in the asymmetric unit of the crystal by (Eq. 5.4.4. u). The atom j has become atom 
J' in the molecular asymmetric unit. The atom j currently under consideration is symmetry 
copy of atom J' . We thus increase the number of symmetry copies ju of j ' by one. The new 
symmetry operation is given by: 

T 7> = R 0 ~%S h J^% (Eq. 5 A4.y) 

The procedure outline above is repeated until all atoms of the molecule have been assigned to 
an new atom in the molecular asymmetric unit (case 1) or to the symmetry copy off an existing 
atom in the molecular asymmetric unit (case 2), 

5.4.5 Delocalized internal coordinates 

In the previous section, we have defined the intramolecular degrees of freedom in terms of 
atomic displacement parameters C fJ where the index i refers to the atom number in the 

molecular asymmetric unit and the index j refers to the cr 1 symmetry allo\ve<L Cartesian 
displacements of atom i. In the following, we will define the molecular conformation in terms 
of a symmetry adapted Cartesian coordinate vector £ that is composed of the; individual 
parameters C/j : 

£ ^^y,..., ) ^ 5.4.5.a) 

As we have seen in section 5.4.1, Cartesian coordinates are not the most efficient way to 
describe intramolecular degrees of freedom. In this subsection, we introduce internal 
delocalized coordinates 0 = (& 1 ,...,&„ 0 ) that are more appropriate for energy minimization. In 
our discussion, we adapt the approach outlined in Baker et al. 1996 to the case oP a molecule 
on a general (no internal symmetry elements) or a special position (internal symmetry 
elements) in a crystalline environment. 

In many computer programs, the so called Z-matrix approach is used in order to define the 
molecular geometry. The Z-matrix is nothing else but a non-redundant set of intramolecular 
coordinates such as bond lengths, bond angles and torsion angles which completely define the 
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molecular geometry. For the energy minimization of complex molecules (covalently bonded 
rings in particular), the Z-matrix approach is often quite inefficient, since the arbitrary 
selection of internal coordinates used to describe the molecular geometry does not correspond 
to the best possible choice. Baker et al. describe the molecular geometry in terms of a 
non-redundant set of delocalized internal coordinates, where each delocalized coordinate 
is a linear combination of all localized internal coordinates (bond length, bond angles, 
etc.). The main merit of their approach is to provide a straight forward procedure for the 
construction of 'the best possible set' of internal delocalized coordinates for any molecule. 

Following Baker et al, we consider a set of nq internal coordinates q = {q A >>*.* 9 q ttq ) T - In 

general, we chose all intramolecular bond lengths, bond angles and torsion angles. 
Symmetry related internal coordinates are considered only once. Bond angles close to 
180° require special attention. The determination of internal coordinates and their first 
derivatives as a function of the Cartesian atomic coordinates will be dealt with at the end 
of this section. 

Small changes Aq of the internal coordinates are related to small Cartesian 
displacements AC by means of the B matrix: 

Aq = BAC (Eq. 5.4.5.b) 

(Eq. 5.4.5.c) 



B = 



In general, there are fare more internal coordinates q t than allowed Cartesian move directions 
£ . As we want to describe the molecular geometry in terms of internal coordinates, it is 
important to remove this redundancy. An elegant way to remove redundancies is to 
diagonalize the nqxnq matrix G = B Q B 0 J \ where B 0 is the B matrix obtained for the initial 
molecular geometry q 0 . Non-redundant directions are readily identified as the n9 mutually 
perpendicular eigenvectors ^ of G with eigenvalues greater than zero. The normalization of 
the eigenvector \jf ( is discussed later. By means of the eigenvectors yr i9 one can define nd 
non redundant coordinates 6 t : 
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0, ft) (Eq.SAS.d) 

As the coordinates 6>. are linear combinations of many primitive internal coordinates, they are 
called delocalized internal coordinates. These are the coordinates used in the natural 
coordinate system to describe the molecular geometry. 

Small Cartesian coordinate changes A^f can be easily converted to changes of the delocalized 
atomic coordinates AO — (A0 1 A0 n9 ) : 

A0 = £A£ (Eq. 5 A5.e) 

S =U T *B (Eq.5A5.f) 
U = (^ 0 ,...,^) (Eq, 5A5.g) 



Here the vectors y/ i form the columns of the matrix U . It is important to note that there are, in 
general, more symmetry allowed Cartesian displacements than delocalized internal 
coordinates. This is due to the fact that whole molecule translations and rotations change Hie 

Cartesian coordinates, but not the internal coordinates. Consequently, B is not a square matrix 
and cannot be directly inverted. However, it is possible to construct an 'inverse B matrix' in 
the following way: 

I" 1 = B T (SS r y l (Eq, 5A5.h) 

A£ = B-*A0 (Eq.5A5i) 

The 'inverse B matrix' defined in (Eq. 5 A5.h) yields Cartesian displacements without any 
translational or rotational component. 

The transformation from delocalized internal coordinates to Cartesian coordinates is not linear 
and (Eq. 5 A5.i) only holds for small coordinate changes. For large coordinate changes, we 
use an iterative procedure to transform delocalised internal coordinates to Cartesian 
coordinates. Let us suppose that the Cartesian coordinates £ correspond to the delocalized 
internal coordinates 6 and that we would like to know the Cartesian coordinates £' that 
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correspond to new delocalized internal coordinates 9\ We use the Cartesian coordinates C as 
a starting point and set<f(o)= £ - Here zero indicates the start of the iteration. Given the 
Cartesian coordinates^ (&), one can calculate the corresponding delocalized internal 

coordinates 0{k) and the corresponding inverse B-matrix B~ l {k) . It is then possible to 
exploit the linear relationship for small coordinate changes to obtain a better estimate for the 
Cartesian coordinates that match 0* : 

C(k + \) = g(k)+£-\k)[§ f -0(k)] (Eq. 5.4.5 j) 

The iterative procedure must be repeated until the Cartesian coordinate changes become 
negligible small. The iteration normally converges very rapidly and convergence to within 
10" 10 A is typically obtained after 3-4 cycles. The speed of convergence obviously depends on 
the quality of the starting point for the iterative procedure. For energy minimizations, we 
always use the structure with the currently lowest energy as a starting point. 

The iterative procedure outlined above yields Cartesian coordinates C' that match the 
delocalized internal coordinates 6\ but the Cartesian coordinates C are not uniquely 
defined. If whole molecule translations and rotations are not forbidden by symmetry 
constraints^ ithe repeated use of the iterative procedure results in ill-defined whole molecule 
translations and rotations in the molecular reference system. A small part of these ill-defined 
whole molecule displacements stems from the accumulation of rounding errors. The main 
effect, however, is related to the fact that the overall rotation of the molecule depends on the 
pathway followed in 6 space. If one changes the geometry from 6 Xo0\ from 6' to 0" and 
from 6" back to<9 5 an overall rotation of the molecule is induced that depends on the 
conformations 6 i and 0" . 

When delocalized internal coordinates are used for the structure optimization of isolated 
molecules, the occurrence of ill-defined whole molecule displacements does not need to be 
taken into account, as such displacements do not result in potential energy changes. It is 
probably for this reason that the problem was not mentioned in the work of Baker et aL In the 
case of crystal structure optimization however, such ill-defined whole molecule displacements 
in the molecular reference system can not be tolerated, as they affect the interatomic distances 
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and thus the potential energy. We solve the problem by introducing a additional whole 
molecule translations and rotations which depend on the Cartesian displacements^ and insure 
that the molecule always adopts a well-defined position and orientation. 

To start with, we replace (Eq. 5 A4.c) by the following expression: 

4' = 4,o+ IX A/ (Eq. 5.4.5 JO 

We then transform the ill-defined Cartesian coordinates &! to well-defined Cartesian 
coordinates^; 

3 = K*M'-Krr) (Eq. 5.4.5.1) 

= W (Eq. 5.4.5.m) 



n 



*-1 >l 



^ = II R i,corA<Pd (Eq. 5A5.il) 



7, A 



S * ES r /Ao =0 for all rotation directions (Eq. 5A5.o) 

In (Eq. 5.4.5.m), n is the total number of atoms in the molecule, i.e. the sum over all 
multiplicities ju t - By subtraction 3' corr from all^', we make sure that the centre of the molecule 
always coincides with the origin of the molecular Cartesian coordinate system. The matrix 
R corr rotates the molecule so that it adopts a well defined orientation. R corr is the product of np 
matrices (see (Eq. 5.4.5.n)) that are identical to the rotation matrices in (Eq. 5.4.4.d), apart 
from the fact that the rotation angles p i have been replaced by rotation angles <p t . (Eq. 

5.4.5.o) uniquely defines the rotation matrix^,.,., as there are as many equations as rotation 
angles <p i . The vectors n k are unit vectors parallel to the symmetry allowed rotation axes in 
the molecular reference frame. 

The determination of the rotation angles (p = (<p^.-.,<p np ) requires some additional 
explanations, as (Eq. 5.4.5.n), (Eq. 5.4.5.o) and Tab. 5.4.4.a define a complex set of non- 
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linear equations. In general, we already know the Cartesian displacements £ 311(1 the rotation 
angles #5 that correspond to the delocalised internal coordinates & , and we would like to 
determine the vectors <f ' and ip r that correspond to new delocalized internal coordinates §' . 
The change or the rotation correction from q> to <p' is fairly small in most cases. We therefore 
use an iterative procedure consisting of a series of linear approximations to determine^'. To 
start with, we set ^(0) = Ip , At each iteration, we first calculate R corr Q) and dR corr I d<p i (/) for 
the current rotation correction <p(/). We then calculate the new rotation correction according 
to the equations below: 

+ 1) = ^(/) - M~ l fh (Eq. 5.4.5.p) 

m^{jn^ 9 ..^m np f (Eq. 5.4.5.q) 



fjh_j^ > 

I' O' 



(Eq. 5.4.5.r) 



(Eq. 5.4.5.S) 



^ -afZir^x^S^^-O (Eq.5.4.5.t) 

The iterative procedure must be repeated until the Cartesian coordinate changes induced by 
the rotation J? cor ,. become negligible small (see (Eq. 5,4.5.1)). The iteration normally 
converges very rapidly and convergence to within 10" 10 A is typically obtained after 3-4 
cycles. 

In the first part of this section, we have provided an overview over the essential mathematical 
concepts required to define delocalised internal coordinates and to convert delocalised internal 
coordinates to Cartesian coordinates. We now add some important details that have been held 
back so far to simplify the above discussion. 

We first turn our attention to the normalization of the eigenvectors y/ t of the G matrix. 
Delocalised internal coordinates are defined with respect to a certain starting geometry given 
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by the atomic positions 0 . For this starting geometry £ 0 , <p 0 and 0 Q are all zero. A unit 
change A9 i = 1 with respect to the starting geometry leads to Cartesian displacements 
. that are given in the linear approximation by the following equation: 

A£ = — B'fr (Eq. 5 A5.u) 

Here A s is the eigenvalue for the eigenvector iff. . The length of \p i is thus inversely related to 
the Cartesian displacements induced by a unit change A#,=l. We normalize with 
respect to the norm of A^f. : 



1 ^r. 



(Eq. 



5A5.v) 

We have seen in section 5.4.1 that coordinates used for energy minimization should be scaled 
in such a way that a unit displacement away from the energy minimum results in roughly the 
same energy change for all coordinates. Performing calculations for several molecular 
crystals* we have found that a value of K e =- 0.033 is roughly compatible with the values 
k-^0.03, 7^=0.02 and ar r =0.1 used to scale the other coordinate types. A more thorough 
parameterization of the scale factors is planned. 

The next item we need to discuss is the diagonalization of the G matrix. In most cases, there 
are significantly more internal coordinate than Cartesian displacement directions. We follow 
[Andzelm et al 2001] and diagonalize the smaller matrix F = B 0 J B 0 instead of the bigger 

matrix G = B 0 B Q T : 

FS^SA (Eq.5A5.w) 

Here S is the matrix of eigenvectors with non-zero eigenvalues and A is the corresponding 
diagonal matrix of eigenvalues. The matrix U that contains the eigenvectors of G is easily 
obtained in a second step: 

U = B Q SX~ m (Eq. 5.4.5.x) 
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A central task when dealing with delocalized internal coordinates is the calculation of the B 
matrix. In most cases, we use a complete list of bond lengths, bond angles and torsion angles. 
Symmetry related internal coordinates are considered only once. The following equations 
relate the bond length 1 and. the bond angle <p (measured in rad) to the Cartesian coordinates 
of the corresponding atoms. The atom numbers are defined in Fig. 8. Fig. 8 illustrates the 
definition of a bond length, a bond angle and a torsion angle. 



/ - -^Yi^i-xi) (Eq- 5 A5.y) 

(Eq. 5.4.5.Z) 



q> = arccos 



V 



1*1 X 2| X 3 J 



The mathematical definition of a torsion angle 6 is a little bit more complicated. We first 
define two vectors \\ and v 2 that are then used to define the torsion angle 9 . Positive torsion 
angles correspond to clockwise rotations when looking down the axis from atom 2 to atom 3. 

v = (x, - x 2 ) - C( * 3 " f 2)7 ( *» = * 3>) - 1 } (Eq. 5.4.5.a2) 

\x 2 -x 3 \ \x 2 -x B \ 

v 2 = (x 4 - x 3 ) - (( * 4 ~ y T ( f 2 = * 3)) f} ~ 1 3 (Eq. 5.4.5.b2) 



0 = 



27m + arccos , 1 p 2 . : ((x 3 -x 2 )x(x 3 — x 2 )) r (x 4 - 5e 3 ) < 0 

WI^L (Eq. 5.4.5x2) 

2a(n + 1) - arccos Z 1 / 2 , : ((*i - 3c 2 ) x (x 3 - x 2 )) r (x A — x 3 ) > 0 



To avoid discontinuities at eclipsed atomic configurations, we attribute a counter n to each 
torsion angle that counts the number of full rotations and is increased/decreased by one each 
time the eclipsed atomic configuration is crossed. 

To obtain the internal coordinates as a function of the Cartesian displacements £ , the vectors 
3c,. in (Eq. 5.4.5.y)- (Eq- 5.4.5x2) need to be replaced by the appropriate vectors 3 tJ which 
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are related to the Cartesian displacements £ via (Eq. 5.4.4.a) and (Eq. 5A4.c). The 
derivatives of the internal coordinates with respect to the components of £ can be obtained by 
the application of standard derivation rules. 

A particular situation arises when a bond angle approaches tc. In this case, the first 
derivatives of the bond angle with, respect to the Cartesian coordinates diverge, and some 
torsion angles may become ill defined. If the central atom has more than two neighbours - a 
situation frequently encountered when dealing with metallo-organic complexes or inorganic 
compounds — it is often possible to simply drop to ill-behaved internal coordinates, as the 
remaining internal coordinates are sufficient to characterize the molecular geometry 
completely. If the central atom has two neighbours, we also drop all ill-behaved internal 
coordinates. In addition, we define two reference directions e x and e 2 that are mutually 
perpendicular and roughly perpendicular to the axis that runs through the two neighbouring 
atoms. The two reference directions allow us to define two new internal coordinates <p x and 
cp 2 that describe the deviation of the central atom and its two neighbours from a straight line. 
Both internal coordinate are the sum of two 'bond angles': 



ft = arccos %% f 2 . } + arccos * ' ^ 3 ^ (Eq. 5.4.5.d2) 

1 — " \eA\x, - x ' 



We also allow for jumps across the central atom to define additional torsion 
angles. In case B of Fig. 9, for instance, we would define a torsion angle that involves the 
atoms 5-1-3-4. Fig. 9 shows three atoms on a straight line. 



In the case of linear molecules, the two reference directions e } and e 2 are determined once 
and for all when the delocalized internal coordinates are initialized and held fixed with respect 
to the molecular reference frame afterwards. In principle, we could do the same for non-linear 
molecules. However, it may happen that one of the two space fixed reference directions 
progressively becomes parallel to the axis running through the atoms 1 and 3 upon structure 
optimization. In such a case, the deviation from a straight line again becomes ill-defined. It is 
therefore more appropriate to chose reference directions related to the atomic coordinates. 
With respect to case B of Fig. 9, for instances, we could have chosen the following atom 
related reference directions: 
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(Eq. 5 A5.e2) 



&2 — &\ ^ C^l 



(Eq, 5.4.5 .£2) 



The first derivatives of ^and (p 2 with respect to the atomic coordinates can again be 
calculated using standard derivation rules. The generalization to more than 3 atoms on a 
straight line is straight forward. 

It is important to realize that the coordinate transformation from delocalized internal 
coordinates to Cartesian coordinates, initialized for a certain starting geometry, may not be 
valid for all conformations of the molecule. Because of the curvilinear nature of the 
delocalized internal coordinates, there may not be a one to one correspondence with the 
Cartesian coordinate system and it may happen that the iterative procedure used for the 
coordinate transformation does not converge if one moves too far away from the starting 
point. In addition, certain bond angles, initially not close to^r, may approach n and vice 
versa, requiring a redefinition of the set of internal coordinates used to define the delocalized 
internal coordinates. During an energy minimization, it is important to monitor if the 
transformation from delocalized internal coordinates to Cartesian coordinates is still well 
defined. If the transformation is no longer valid, the minimization needs to be stopped and the 
coordinate transformation has to be reinitialized using the currently best structure as the new 
initial geometry. Then, the minimization procedure can be restarted. 

In this section, we have introduced two significant advancements compared to the approach 
described by Baker et al. Firstly we have reformulated the approach of Baker et al in such a 
way that the molecular point group symmetry can be explicitly taken into account, resulting in 
an important reduction of the total number of degrees of freedom. Secondly, we have 
addressed and solved the problem of ill-defined whole molecule translations and rotations, 
making it possible to use delocalized internal coordinates not only for isolated molecules but 
also for molecules in a crystalline environment, 

5*4.6 From relative coordinates to natural coordinates and back - a summary 
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In subsection 5.4.3 to subsection 5.3.5, we have defined the natural coordinate system and we 
have described the various steps required to transform natural coordinates to fractional 
coordinates. As the great amount of mathematical detail presented in the previous subsections 
may hamper the understanding of the overall procedure, we summarize the most important 
steps in this subsection. 

The use of natural coordinates for lattice energy minimization implies three distinct tasks: the 
initialization (one may also say the definition) of the natural coordinate system, the 
transformation from natural coordinates to fractional coordinates and the transformation from 
first energy derivatives in fractional coordinates to first energy derivatives in natural 
coordinates. In this subsection, we take a second look at the first two tasks. The 
transformation of energy derivatives is described in the subsequent subsection. 

In most databases, scientific publications or program outputs, crystal structures are defined in 
terms of lattice parameters and fractional atomic coordinates. The initialization of the natural 
coordinate system consists of the determination of a certain number of scalars, vectors and 
matrices that relate the natural coordinate system to the fractional coordinate system. A list of 
the required scalars, vectors and matrices is presented in Tab. 5.4.6.a. 

The transformation from the natural coordinate system to the fractional coordinate system 
involves the following steps. First new lattice vectors a, b and c and a new transformation 
matrix Tare determined using (Eq. 5.4.3 .a), (Eq. 5.4.3.b) and (Eq. 5.4.3. i). The lattice 
parameters Ci used in the fractional coordinate system can be obtained from the lattice vectors 
5, h and c by means of (Eq. 5A3.c> (Eq. 5 A3.h) and Tab. 5.1. a. 

In a second step, all independent molecules must be dealt with one by one. First, the 
delocalized internal coordinates 0i , ... G ne must be converted to Cartesian displacements 
fusing the iterative procedure described in subsection 5.4.5. The Cartesian displacements 
can be converted to uncorrected Cartesian coordinates using (Eq. 5.4.5.k). Then, corrected 
Cartesian coordinates in the molecular reference frame must be determined using (Eq. 5.4.5.1) 
- (Eq. 5.4.5.o) and the iterative procedure for the calculation of R COfT ((Eq. 5A5.p)- (Eq. 
5.4.5.t)). Finally, after having determined the whole molecule translation from (Eq. 5.4.4.e) 
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and the whole molecule rotation from (Eq. 5 AAA), the atomic positions v^used in the 
fractional coordinate system can obtained from (Eq. 5.4Ak). 

Tab. 5.4.6.a: Scalars, vectors and matrices that are determined when the natural coordinate 
system is initialized, Objects referring to molecular properties need to be defined for each 



molecule. 



Symbol 


Description 


T 


Initial transformation matrix from fractional atomic coordinates to the external 
Cartesian coordinate system 




Matrices defining symmetry allowed lattice deformations 


cry(ij) 


Lookup table that relates atom i in the asymmetric unit of molecule j to the 
corresponomg aioni m mc dbyiiiiiiciiiL/ mm. ox inw ^ij^icu 


& at (i) 


t r./>i'nn +dKi^o rp»l£itf»c thf* atom i in the awm metric unit of a crystal to the 

l_X)OKUp XaOieS inaiXeidlcS IHC cllUIH I III uxc a&jrxixii«--u.xv^ uu.xj.i- kjj. «. vijoiui w 

corresponding atom in the asymmetric unit of a molecule 


s & f. 


Symmetry operations relating the atoms in the asymmetric unit of the crystal to 
the corresponding atoms in a molecule 




Molecular point group symmetry operations generating symmetry copy j of 
atom i 


% 


Molecular centre in fractional coordinates 


I 


Symmetry allowed molecular move directions in fractional coordinates 




Transformation matrix from molecular Cartesian coordinate system to the 
shifted external Cartesian coordinate system 




Symmetry allowed rotation matrices 




Initial atomic coordinates in the molecular Cartesian coordinate system 




Symmetry allowed atomic move directions in Cartesian coordinates 




Complete list of internal coordinates (bonds lengths, bond angles, etc.) 




Initial values of the internal coordinates 




Vectors defining a non-redundant set of delocalized internal coordinates 


&corr 


Translation that insures that the molecular centre coincides with the origin of the 
molecular coordinate system. Initially zero. 




Rotation angles that insure that the molecule adopts a well defined orientation in 
the molecular coordinate system. Initially zero. 



5.4.7 Energy derivatives with respect to natural coordinates 

In section 5.2 and section 5.3 we have seen how energies and energy derivatives can be 
calculated using the fractional coordinate system. In this subsection, we describe how energy 
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derivatives the fractional coordinate system can be converted to energy derivatives in the 
natural coordinate system. 

To simplify the general part of the discussion, we define a vector $ that comprises all ncj) 
coordinates of the fractional coordinate system (cell parameters c* and relevant components of 
the atomic positions v f ) and a vector fj =(Xq 9 ... 5 Xnx, t\ 9 u tmi,!? Puu pnpi,i 3 0i,i 9 • 
©nouijt^ —) that comprises all nrj natural coordinates. Using the chain rule for derivatives, 
one obtains the first energy derivatives with respect to natural coordinates; 

^*iF5 (Eq.5A6.a) 
dij d<l> 

Here dE/d$ is known and Q is defined below: 



(Eq, 5.4.6.b) 



In principle, the derivatives dfy /dr/j can be obtained by applying standard derivation rules to 

the equations in subsection 5.4,3, 5.4,4 and 5.4.5. We therefore only discuss some aspects of 
the derivative calculation. 

The lattice parameters Cj of the fractional coordinate system only depend on the parameters \\ 
of the natural coordinates system. The relationship between the two sets of parameters can be 
established via Tab. 5 J .a, (Eq. 5.4.3.c)- (Eq. 5.4.3.h) 9 (Eq. 5.4.3.a) and (Eq. 5.4,3.i), 

The atomic coordinates v i depend on all natural coordinates. The central equation for the 
transformation from natural coordinates to atomic coordinates v,is (Eq. 5.4.4.k), IT 1 is the 
only term in (Eq, 5A4.k) that depends on the natural coordinates \ 0 , X n x. fiZT l /9/l,is 
related to / via: 

— = -I' 1 — I" 1 (Eq, 5.4.6.C) 

bX t dA,, 
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L depends on X 0 , via (Eq. SA3.b) and (Eq. 5.4.3 .i). f mfl/(;) is the only term in (Eq. 

5.4Ak) to depend on the natural coordinates ti, mo i(i> . x m , m oi(i) via (Eq. 5.4 Ae) and R MoKn is 
the only term to depend on the natural coordinates pi, moi(i> — Pn P , mo\{\) via (Eq. 5.4- Ad). 
Finally, ^ <0sMj9 /( 0 is the only term that depends on the delocalized internal coordinates 6 1 9 m oi(i), 

©nei, moi(i)- Knowing the derivatives of L , T mw/(/) 9 

Koto)™* & am<noi(i) with respect to the 
natural coordinates, the derivatives of v i with respect to the natural coordinates are readily 
obtained by the application of standard derivation rules to (Eq. 5.4 Ak). 

The calculation of the derivatives of i9 ; (Here we have dropped the index mol(i) and we have 
renamed at(i) to i in order to simplify the succeeding expressions) with respect to 9j is a little 
bit more complicated than the calculation of the other derivatives. The corrected Cartesian 

coordinates 3 f are related to the uncorrected Cartesian coordinates via (Eq. 5.4.5.1), We 
thus obtain: 

!f = %H,9/-^ , rr ) + R, (Eq. 5.4.6.d) 

Taking into account (Eq. 5A5.h), (Eq. 5A5.i) and (Eq. 5A5.k| 5,9// 90, is straight forward 

to calculate. Expanding the left side of (Eq. 5 A5.o) into a Taylor series up to first order, one 
obtains: 



(Eq. 5.4.6.e) 



BR 

XT' UJS -corr 


d<p t 




d0 J 







K d0/ '80,; 



(Eq. 5.4.6.f) 



m = {m l ,...,m„ p ) T (Eq. 5.4-.6.g) 

'jLJt— a5 /N 



J=l m Corr $9 j ) 



(Eq. 5.4-.6.h) 
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The matrix Mis defined in (Eq. 5A5.s) and (Eq. 5.4.5.t). With 5^7 30,- and 
dR cmr 189 j known, (Eq. 5 A .6 A) can be used to calculate 83^ / 36 j . 

In this section, we have dealt with the calculation of first lattice energy derivatives. If 
required, second derivatives can be obtained in a similar fashion. 

5.5 Refinement of empirical parameters for tfie hybrid method 
5.5.1 Deviation function 

To optimize the empirical parameters used with the hybrid method, we need to define a 
function F(pu ...,p n ) that depends on the empirical parameters p* and that reaches the global 
minimum when a good fit with the experimental data is obtained. Once such a function is 
defined, suitable empirical parameters can be obtained, at least in principle, by minimization 
of this function. In the following, we will call F(pi, . . .,p n ) the deviation function and construct 
it un such a way that it is zero if all experimental data are precisely matched and bigger than 
zero otherwise. 

■A V 

Since we want to adjust the empirical parameters to low temperature crystal structures, we 
need to quantify the deviation between a set of experimental crystal structures and a 
corresponding set of calculated crystal structures. The empirical potentials mainly affect the 
unit ceil parameters and the molecular arrangement in the unit cell, while they have little 
effect on the molecular conformations. It is therefore appropriate to focus on the comparison 
of unit cell parameters. 

Given two unit cells with unit cell vectors 2, , b x , c x and a 2 , b 2 , c 2 we calculate the 
transformation matrix T\j>2 from one set of unit cell vectors to the other: 

3 2 =^>2 «i > b 2 = T x _ >2 b x , c 2 = T^ >2 c x (Eq. 5.5.1.a) 

The transformation matrix can be determined from the two matrices Li and L 2 the columns of 
which are identical to the unit cell vectors: 
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h =(fl i9 b t9 c t ) 



(Eq. 5.5.1. b) 
(Eq. 5.5.1. c) 



According to the linear algebra theorem on singular value decomposition, any square matrix 
can be written as the product of an orthonormal matrix U x zl. diagonal matrix D and another 
orthonormal matrix U 2 : 



T^ >2 =U l DU 2 =T wt T def 



T def ^U 2 l DU 2 



(Eq. 5.5. l.d) 
(Eq. 5.5.1.e) 
(Eq.5.5.Lf) 



7^ can be understood as a compression/expansion of "the original lattice along three 
perpendicular directions. The directions are given by the rows of U 2 9 while the size of the 
compression/expansion is given by the diagonal elements oJF D . The matrix T roi is a rotation 
matrix. The transformation T } _ >2 thus consists of a deformation followed by a rotation. Only 
the deformation changes the geometry of the unit cell axid is therefore relevant for the 
construction of the deformation function F. To characterize the overall deformation by a 
single value 3 wc use the following two expressions: 



3 } 



















) 



(Eq. 5.5. l.g) 



f-1 Z 



(Eq. 5.5.1. h) 



Here di, d 2 and d 3 are the diagonal elements of D . Both expressions are symmetric with 
respect to di and 1/di to insure that the transformations T w> ^ and T 2 _ >} are attributed the same 
value. (Eq. 5.5. l.h) is more appropriate for parameter optimization, since it allows for 
continuous first and second derivatives. (Eq. 5,5. Lg) results in discontinuous derivatives 
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whenever it reaches zero, but the physical meaning of this expression is easier to understand. 
For small deformations, (Eq. 5.5.1.g) reduces to: 

A^SK-ll CEq. 5.5.1.1) 

In the absence of any lattice deformation, all di equal one and A, is zero. If the lattice is 
compressed/expanded by 1% along a single direction, A r equals 1%. If the lattice is 
compressed/expanded by 1% along all three perpendicular directions, A ^ equals 3%. A f thus 
is the sum of the absolute values of the individual compressions/expansions. 

The deviation function for a set of N crystal structures is given by the following expression: 
F = Ti-%- (Eq. 5.5.1.J) 

Here A /7J is the deviation between the j-th experimental crystal structure and the 
corresponding result of a lattice energy minimization, w* is a weighting factor. To compute 
F for a given set of empirical parameters, one has to minimize the lattice energy and evaluate 
(Eq. 5.5.1. h) for each member in a set of N crystal structures. 

It is easy to generalize (Eq. 5.5. l.j) so that it takes into account additional experimental 
information. If a crystal structure A is known to be more stable than another crystal structure 
B of the same molecule, one may add a term that contains a deviation of the following type: 

^ab=\^ A Z E ^ 2 Ea ~ E J (Bq. 5.5.1 JO 



0 E A <E B 



Here E A and E B are the lattice energies per molecule of the two crystal structures. For the 
parameter refinement presented in subsection 5.5.4, we have only used structural information. 



5.5.2 Optimization procedure 
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In principle, one could minimize the deviation function F directly with a standard algorithm 
such as the Powell minimization algorithm. In practice, however, this straight forward 
approach turns out to inconvenient. At each function evaluation, all crystal structures need to 
be re-optimized and the calculation times for the lattice energy minimization of a single 
crystal structure range from several hours to several days on a state-of-the-art PC. Even if the 
calculations are carried out on a PC cluster, the CPU times involved get rapidly out of hand. 

To cut down on calculation times, we use an iterative procedure to exploit the fa_ct that the 
energies and forces from the time consuming DFT part of the calculation do not change when 
the empirical parameters are modified (see Fig. 10). Fig. 10 shows a flow diagram for 
parameter refinement. Starting from an initial set of reasonable empirical parameters, all 
crystal structures are optimized. For each crystal structure, a set of points is chosen around the 
lattice energy minimum. DFT energies and gradients are calculated at all these points, which 
have to be chosen in such a way that the second derivatives (Hessian) matrix can be 
constructed from the gradients. 

In a second step, the empirical parameters are optimized without performing any additional 
DFT calculations. The Powell algorithm [Press et al 2002] used for the parameter 
optimization does not require any derivative information. Each time the deviation function F 
needs to be evaluate*! for a new set of empirical parameters, we determine the new calculated 
crystal structures in the harmonic approximation using the stored energy and gradient 
information. For each crystal structure, we calculate the VdW energies and gradients at the 
chosen points and combine them with the stored DFT energies and gradients to get the total 
energies and gradients. The later are used to determine the second derivatives matrix. Using 
the second derivatives matrix in conjunction with the energy and the gradient at "the central 
reference position, the new lattice energy minimum is determined approximately. 

The Powell algorithm returns an improved set of empirical parameters that is used as a 
starting point for the next iteration which begins with the lattice energy minimization of each 
crystal structure. The iterative procedure is stopped when sufficient convergence of the 
empirical parameters is achieved. In the following paragraphs, we discuss some aspects of the 
optimization procedure in more detail. 
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At each step of the iterative procedure, the initial crystal structure optimizations are carried 
out with fully flexible molecules. From then on, all molecules are considered to be rigid to 
reduce the number of structural degrees of freedom. 

We now describe how the lattice energy minimum of a crystal structure <3an be determined in 
the harmonic approximation. Let q min =(q m in,o 5 -.. 9 q m in,m) he the vector ofm coordinates (unit 
cell deformations, whole molecule translations and whole molecule rotations) that 
corresponds to the lattice energy minimum obtained for empirical parameters p == (pi, ... 5 p n ). 
Around the minimum q min y we chose m points q +i and m points q__ j : 

q^q^ -Aqi*e, (Eq.5.5.2.b) 

Here e t is the unit vector for which all elements except for the i-th element are zero. For each 
i, the scalar value Aqi is chosen such that: 

Epot, P (^)- E pot ,p (q min ) = Ep 0t ,p (#_,,)- E po t, P (q^ )~ AEtarget (Eq. 5.5 .2.6) 

In the above equation, the index p refers to the empirical parameters p = (pi, . ~ ? p n ). AEtarget is 
a^onstant used in the calculation. A value of AEtarget = 0-25 kcal/mol has turned out to be 
appropriate in practice. The use of a target energy AE ta r g ct insures that the points q + i and 
are positioned at the right distance from the minimum: within the potential energy well of 
the local minimum, but far away enough to avoid problems related to the noise level of the 
energy calculation. For later use, we calculate and store the gradients VEdft( 

VEdft( q- 4 X VE DFT ( q m - m ) and the energy E DFr ( ft* ). 

Now let us suppose that we would like to know the minimum of the lattice energy for a new 
set p'= (p'i, ...,p' n ) of empirical parameters. We can obtain the following energy and 
gradients without any further DFT calculations: 

Epot, P ' (? ra in) = E DF T(^ mill ) + EvdW, P ' (q min ) (Eq. 5.5.2.d) 

VEpot,p' ( q min ) - VE DFT ( q^ ) + VEvdW, P ' ( ) (Eq. 5.5.2.e) 
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VEpot,p> («+,,) = VE D ft + VEvdw,p' for 0<i<m (Eq. 5.5.2 J) 

VEpotp* (£-,,) = VE DFr + VE V dw,p> (g^) for 0<i<m (Eq. 5.5.2.g) 

(Eq. 5.5.2.f) and (Eq. 5.5.2.g) can be used to construct the second derivatives matrix H . We 
first construct the matrix M the columns of which are given by: 

i-th column of M - (VE po t,p> VE pot , p < (£^))/2/Aqi (Eq. 5.5.2.h) 

For infinitely small Aqi and in the absence of rounding errors, the matrix M is equal to the 
second derivatives matrix. Since we use finite values of Aqi, this relationship holds only 
approximately. Unlike the second derivatives matrix H 9 Mis not exactly symmetric in most 
cases. We therefore use the following expression to obtain a symmetric approximate second 
derivatives matrix: 

F = 0.5*(Ji7+M T ) (Eq. 5.5.2.i) 

Here M T is the transpose of M . In the harmonic approximation, the new lattice energy 
minimum q' mln is given by the following expression: 

= ?nun - VE pot , p * ( q^ ) (Eq. 5.5.2.j) 

H~ x is the inverse matrix of H . The lattice energy at the new minimum is: 

E P ot, P > (qU* )= E pot , p , )+ VEp 0 ^ (^ ) T Aq +0.5* Aq T H Aq 

with Aq = q^ - ^ (Eq. 5.5.2.k) 

5*5.3 Starting values for the form factor n 9 the damping radii r and the C<? coefficients 

The optimization procedure described in the previous section leads to the local minimum for 
which the basin of attraction reaches out to the set of starting parameters. As el consequence, 
one can only get close to the global minimum if appropriate starting parameters are chosen. 
The use of more sophisticated global optimization methods that do not depend on the choice 
of starting parameters (The method described in 5.5.2 is a local optimization merthod.) such as 
simulated annealing or parallel tempering is currently beyond reach because of the long 
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calculation times involved. Our derivation of appropriate starting parameters closely follows 
the work of Wu and Yang [Wu and Yang 2002]. 

For the form factor n, we set n=l as this choice corresponds to the damping function used by 
Wu and Yang. Starting values for the homonuclear damping radii ta,a are obtained by 
multiplying Bondi's VdW radii [Bondi 1964] by a factor of two. All atoms of the same 
element are attributed the same starting value, regardless of their chemical environment. 
Starting values for heteronuclear damping radii ta,b are generated using an arithmetic 
combination rule: 

r *j> = \ ( r ,u + t b,b) ( e <1- 5.5.3-a) 

Values for the C 6 coefficients can be obtained by fitting atomic C 6 coefficients to molecular 
C6 coefficients from dipole oscillator strengths distributions [Wu and Yang 2002]. According 
to Wu and Yang, the atomic C$ coefficients significantly depend on the hybridisation state of 
the atom. We use the hybridization dependent Ce coefficients proposed by these authors. Ce 
coefficients for unlike atom pairs can be obtained from the Cg coefficients of like atom pairs 
using the following combination rule: 

r _ 2{Cl )AA C 6 ^ n N effJi N qfftB ) 1/3 

Here N e ff is the effective electron number. Like Wu and Yang, we use the effective electron 
numbers proposed by Halgren [Halgren 1992]. 



The various parameters mentioned above are summarized in Tab. 5.5.3. a for the different 
hybridization depend atom types of hydrogen, carbon, oxygen and nitrogen. For nitrogen, we 
use a single atom type because the available experimental information does not allow for the 
refinement of hybridization dependent Ce coefficients in this case. For carbon and oxygen, we 
derive the hybridization state from the number of covalently bonded neighbours. 



It is important to stress that the approach presented in this document is not limited to 
hydrogen, carbon, oxygen and nitrogen. VdW radii and effective electron numbers are readily 
available for many elements and molecular Ce coefficients have been published for molecules 
containing elements such as F, CI, Br and S. 
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Tab. 5. 53. a: Starting parameters for hydrogen, carbon, nitrogen and oxygen. 



Atom type A 


r X 6i t / n 

C 6 ,a,a [A D kcal/mol] 


"KT 

N e ir,A 


ta,a [A] 


H 


38.93 


A OA 


2.4U 


C(sp 3 ) 


303.8 


2.49 


3.40 


C(sp 2 ) 


376.6 


2.49 


3.40 


C(sp) 


409.8 


2.49 


3.40 


N 


265.8 


2.82 


3.10 


0(sp 3 ) 


159.9 


3.15 


3.04 


0(sp 2 ) 


178.3 


3.15 


3.04 



The technique for the derivation of molecular Ce coefficients from oscillator strength 
distributions is a mixture of experimental and theoretical components. Even though not 
completely based on experimental data alone, it is likely that the molecular C$ coefficients 
used by Wu and Yang are fairly accurate. As their atomic C6 coefficients reproduce the 
molecular data extremely well, it can be assumed that the atomic Ce coefficients accurately 
model the Van der Waals interaction at long interatomic distances. The C$ coefficients in Tab. 
5.5.3.a are thus much more than just reasonable starting values and further refinement of these 
values is not necessarily required. In other words, the long range part of the empirical pair 
potentials can be determined independently of the exact nature of the DFX part of the hybrid 
method. The damping radii and the form factor, on the other hand, determine the behaviour of 
the empirical pair potentials at short to intermediate interatomic distances where the DFT part 
and the VdW part of the hybrid method are both important. They must be adjusted such that 
the hybrid method as a whole accurately reproduced experimental results. 

5,5,4 Preliminary parameter refinement for C, N, O and H 

Ultimately the approach described in this document is expected to work for a wide variety of 
elements. At the point in time when this document is written, however, only a preliminary 
parameter refinement for C, N, O and H has been carried out and a more advanced parameter 
refinement including the additional elements Cl 5 F and S is in progress. In this section we 
describe the results of the preliminary parameter refinement for C, N, O and H. 

Tab. 5.5 Aa: Set of crystal structures used for parameter refinement. 
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, Co raposifipp^ 


WW 


fttHiil 


lm 


benzene 


BENZEN06 


C6H6 


15 


463 


2.2 


butane 


Refson et al 1986 


C4H10 


5 


223 


1.4 


propane 


Boeseetal 1999 


C3H8 


30 


365 


2.3 


durene 


Neumann etal 1999 


C10H14 


1.5 


403 


4.0 


p-xylene 


Prageretal 1991 


C8H10 


4 


311 


1.3 


acetylene 


McMullanetal 1992 


C2H2 


15 


208 


5.0 


acetylene cubic 


McMullanetal 1992 


C2H2 


143-*0 


216 


0.9 


acetic acid 


ACETAC06 


C2H402 


4 


289 


2.2 


formic acid 


FORMAC02 


CH202 


5 


193 


1.6 


furan 


BUNJAV02 


C4H80 


5 


401 


2.5 i 


methanol 






1 « 

1 3 




7 ^ 


terephtalic acid 


TEPHTH03 


C8H604 


2 


169 


3.9 


hexamethylenetetr amine 


HXMTAM10 


C 6 H 12 N 4 


15 


332 


1.3 


nitrogen alpha 


ivliiis et ai i v oo 


N2 






1.9 


nitrogen gamma 


iviiiis et ai i you 


JS12 


on 
ZK) 


on 
oU 


0 1 
Z, 1 


DJ-llOIIlCulailC 




l^Xij IN ^2 


A 
* 


^ / _> 




N 5 N' -diformohydrazide 


FOMHAZ16 


C2H4N2O2 


15 


178 


2.4 


N-hydroxy- 
methaneimidamide 


FORAMO01 


CH 4 N 2 0 


16 


277 


9.8 


glyoxime 


GLOXIM11 


C 2 H 4 N 2 02 


9 


178.6 


6.2 


urea 


UREAXX12 


CH4N2O 


12 


145 


3.4 



To refine the empirical parameters used with the hybrid method, a set of 20 low temperature 
crystal structures was selected which is listed in Tab. 5.5.4.a. The table presents the 
compound name, the CSD reference code or a literature reference for the crystal structure, the 
composition of the compound, the experimental temperature and the volume of the unit cell. 
The cell parameters of the cubic phase of acetylene were extrapolated down to OK using cell 
parameters from a series of measurements at 143 K and above. All crystal structures were 
measured at normal pressure apart from the crystal structure of y-nitrogen determined at a 
pressure of 407 MPa. The crystal structures listed in Tab. 5.5 Aa correspond to a total number 
of 62 variable unit cell parameters. 

In a first refinement, only five parameters were adjusted, namely the form factor n and one 
damping radius per element. The C<$ coefficients were kept constant at the values indicated in 
Tab. 5.5.3.a. The combination rules (Eq. 5.5,3.a) and (Eq. 5.5.3.b) were used to obtain 
damping radii and Ce coefficients for pairs of unlike atoms. Following the procedure outline 
in section 5.5.2, the five parameters were adjusted to reproduce the unit cells of the 
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experimental crystal structures, i.e. to minimize (Eq. 5.5. l.j) with m>. =0.001. After two 
iterations (see Fig. 10), the parameter refinement had essentially converged and a value 
F=650 was obtained for the deviation function. The individual deviations A ; (see (Eq. 
5.5. l.g)) of the calculate crystal structures from the experimental crystal structures after 
parameter refinement are shown in the last column of Tab. 5.5 Aa. On average, calculated and 
experimental crystal structures deviate by A 7 =3.3%, corresponding to average cell lengths 
errors of roughly 1%. In summary it can be said that the adjustment of only 5 parameters 
yields a very satisfying agreement between the calculated and the experimental crystal 
structures. The values of the adjusted parameters before and after parameter refinement are 
presented in Tab. 5.5 Ab. 



Tab. 5.5 Ab: Values before and after parameter refinement. 



Parameter 


before 


after 


n 


1.00 


0.2621 


*H,H [A] 


2.40 


3.3029 


rc.c [A] 


3.40 


3.8140 


rN.N[A] 


3.10 


3.3255 


ro.o[A] 


3.04 


2.8301 



Using the results in Tab. 5.5. 4.b as a starting point, several attempts were made to further 
improve the agreement between the optimized and the experimental crystal structures by 
allowing for more degrees of freedom in the parameter refinement (hybridization dependent 
damping radii, individual damping radii for unlike atom pairs instead the using a combination 
rule, C$ coefficients) and by trying different combination rules instead of (Eq. 5.5.3.a). None 
of these attempts led to significant improvements and many refinements resulted in 
unreasonable parameters for the damping radii or the C 6 coefficients. In general, it was found 
that there is a very strong correlation between the damping radii on the one hand and the form 
factor and the C6 coefficients on the other hand. Almost the same overall agreement between 
the calculated and the experimental crystal structures can be obtained for different form 
factors and/or different sets of C 6 coefficients if the damping radii are adjusted independently 
for each of these choices. To avoid unphysical values, it is therefore advisable to us the Ce 
coefficients from Tab. 5. 5. 3. a without further refinement. 
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It needs to be stressed that the results presented in this section are only preliminary. There 
may be other damping functions, combination rules or sets of adjustable parameters that yield 
a significantly better agreement between the calculated and the experimental crystal 
structures. A more thorough investigation of the various options is currently under way. It is 
also important to note that parameters in Tab. 5.5.4.b are only guaranteed to yield accurate 
results in conjunctions with the DFT method described in 5.2 that was used for the parameter 
refinement. If the hybrid approach is to be used with another DFT method, a 
reparameterization of these parameters may be required. 

5.6 Energy ranking and in silico polymorph screening 

In the previous section, we have shown how the empirical parameters of the hybrid method 
can be derived from structural information. In principle, there is no guarantee that the 
obtained parameter set is also appropriate for the accurate energy ranking of crystal structures. 
In this section, we present a validation study which clearly indicates that the hybrid method, 
with the parameters obtained in subsection 5.5.4, offers the accuracy required for in silico 
polymorph screening. 

The validation study was carried out for a test set of six small molecules, namely ethane, 
ethylene, acetylene, methanol,, acetic acid and urea (see Fig. 1 1). The test set covers a variety 
of chemical interactions. Ethane, ethylene and. acetylene contain a single, double and triple 
carbon-carbon bond, respectively. Their crystal structures are predominantly held together by 
Van der Waals interactions, as these molecules cannot form hydrogen bonds and have rather 
small atomic partial charges. Methanol, acetic acid and urea are examples for moderately to 
strongly hydrogen bonded networks. 

For all six molecules, the same procedure was followed. First, possible crystal packings were 
generated by running Accelrys 5 s Polymorph Predictor [Accelrys 2004] twice with fine 
settings in the 17 most populated space groups. In five out of six cases, the Polymorph 
Predictor was used in conjunction with the CVFF95 [Accelrys 2004] force field which is well 
parameterized for many small molecules. However, CVFF95 turned out to be a poor choice 
for the crystal structure optimization of acetylene, and the Universial Force Field with Qeq 
charges [Accelrys 2004] was used instead. In a second step, the lattice energy of the 10-15 
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most stable crystal structures from the output of the Polymorph Predictor was re-minimized 
using the hybrid method and the lattice energy minimizer described in this document. 

Before we present a summary of the results obtained for all six molecules, we 
further illustrate the approach by a more detailed discussion of the case of acetylene. Fig. 12 
compares the energy ranking of the Polymorph Predictor to the energy ranking obtained with 
the hybrid method for the 15 most stable crystal structures found with the Polymorph 
Predictor (structures 1 1 — 15 are very similar in energy and appear as a single line). Several 
crystal structures proposed by the Polymorph Predictor minimize towards the same crystal 
structure for the hybrid method. Using the hybrid method, the low temperature crystal 
structure of acetylene if found as the most stable crystal structure (rank 1) and the polymorph 
observed experimentally at 143 K and above is found as rank 3. The intermediate crystal 
structure (rank 2) is closely related to rank 1. Both crystal structures consist of identical planar 
arrangements of molecules stacked in a similar but slightly different way. The crystal 
structures of rank 1, rank 2 and rank 3 are shown in Fig. 13 together with the two 
corresponding experimental crystal structures. The hybrid method accurately reproduces the 
known polymorphs of acetylene and their relative order of stability. Fig. 13 shows the three 
most stable crystal structures of acetylene found with the hybrid method according to the 
invention; for rank 1 and rank 3, a superposition with the experimental crystal structure is 
v , presented. 

The excellent results obtained for acetylene are representative for the overall performance of 
the hybrid method. The energy ranking for all six molecules of the test set is summarized in 
table 5.6.a. The first three columns show the name of the compound, the temperature at which 
, the low temperature crystal structure was determined and the name of the force field used in 
conjunction with the Polymorph Predictor. The forth column indicates the rank, in the output 
of the Polymorph Predictor, of the crystal structure that corresponds to the experimental low 
temperature structure. The good performance of the CVFF95 force field (rank 1 or 2 in all 
cases) is somewhat surprising. It illustrates that a force field, if well parameterized for a 
certain class of molecules, can be appropriate for in silico polymorph screening. However, 
force fields only work for a limited number of molecules, as illustrated by the fact that 
C VFF95 fails completely for acetylene. Tests for other small molecules (butane, paracetamol, 
etc) have shown that the Polyporph Predictor in conjunction with the CVFF95 force field 
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typically finds the experimental crystal structure among the first 10 crystal structures. The 
performance observed for the test set is thus significantly better than the average performance. 

Columns 5 in Tab. 5.6.a present the rank of the experimental crystal structure obtained after 
energy minimization with the hybrid method. Column 6 indicates the energy difference 
between the experimental crystal structure and rank 1. In 5 out of six cases, the experimental 
crystal structure corresponds to the most stable crystal structure, ethane being the only 
exception. Concerning the case of ethane, it has to be taken into account that the crystal 
structure of ethane was determined at 85 K. At this temperature, entropy effects are not 
negligible and it may well be that the most stable calculated crystal structure corresponds 
indeed the most stable crystal structure at 0 K 3 but not at 85 K. In addition it is important to 
note that the numerical accuracy of the DFT part of the calculation (see section 5.2) is only 
guaranteed to be better than 0.01 kcal/mol, thus being of the same order of magnitude as the 
energy difference between rank 1 and rank 3. Column 7 in Tab. 5.6.a indicates the number of 
different crystal structures found in a small energy window of 0.02 kcal/mol above the most 
stable crystal structure. Typically, the energy window contains more than one crystal structure 
- a fact with demonstrates that an accuracy of better than 0.01 kcal/mol is indeed required for 
the accurate energy ranking of polymorphic crystal structures. 



Tab. 5.6.a: Energy ranking obtained for 6 small molecules with the Polymorph Predictor and 
with the hybrid method (see text for details). 



Compound 


T-exp 

[K] 


Force 
field 


FF 
rank 


Hybrid 
rank 


Hybrid 
energy 
[kcal/mol] 


Nb in window 
0.00-0.02 
Tkcal/mol] 


Acetylene 


15 


UFF 


12 


1 


0.0 


1 


Ethylene 


85 


CVFF95 


1 


1 


0.0 


3 


Ethane 


85 


CVFF95 


1 


3 


0.009 


3 


Methanol 


15 


CVFF95 


1 


1 


0.0 


2 


Acetic acid 


4 


CVFF95 


2 


1 


0.0 


1 


Urea 


12 


CVFF95 


2 


1 


0.0 


2 



The results obtained with the Polymorph Predictor and some of the figures shown in this 
section were kindly provided by Aventis (France) as part of a collaboration in the field of in 
silico polynaorph screening. 
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In summary, it can be said that the hybrid method, with the empirical parameters derived in 
section 5.5.4, offers an unprecedented accuracy for the energy ranking of crystal structures. 

5.7 The polymorphism of N2 — an interesting test case 

Molecular nitrogen is an ideal test case to assess the accuracy of the hybrid method. On the 
one hand, the number of empirical parameters is extremely small. Indeed, we only refine the 
damping radius r^N, using the coefficient from Tab. 5.5.3.a and setting n = 1.0 for the 
form factor. On the other hand, a significant amount of experimental information is readily 
available [Mills et al 1986]. At low temperature, three ordered polymorphs have been 
observed. The oc-polymorph is the most stable structures at normal pressure. With increasing 
pressure, a first phase transition from the a-polymorph to the y-polymorph is observed at 
about 330 MPa. A second phase transition from the y-polymorph to the s-polymorph follows 
at about 1.9 GPa. The crystal structures of the three polymorphs have been determined 
experimentally at 20K and normal pressure for the a-polymorph, at 20K and 407MPa for the 
Y-polymorph and at 11 OK and 7.8 GPa for the s-polymorph. The availability of both structural 
information and energetic information (pressure dependent order of stability) provides a 
stringent test for the hybrid method. 

Using the refinement procedure described in 5.5, we first adjust the damping radius r N ,N such 
that the hybrid method reproduces the experimental crystal structures of oc-N 2 and y-N 2 . A 
value of rNjvj = 2,76 A is obtained. Fig. 14 compares the calculated to the experimental crystal 
structures for a variety of cases. Fig. 14 shows a comparison of the experimental and 
calculated crystal structures of a-N 2 (left) and y-N 2 (right) for three different calculations, 
each of the 6 graphics being a superposition of two crystal structures. The upper two crystal 
structures were obtained using the DFT method described in 5.2 without any additional 
empirical potentials. The two calculated unit cells are significantly larger than the two 
experimental unit cells, reflecting the fact the attractive long range dispersive interactions are 
missing. For the calculation of the next two crystal structures, the DFT calculations were 
combined with the empirical potentials proposed by Wu and Yang [Wu and Yang 2002]. In 
both cases, the calculated unit cells are smaller than the experimental unit cells, illustrating 
the fact that the empirical N-N potential of Wu and Yang overestimates the strengths of the 
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empirical corrections at short and intermediate interatomic distances. The lower two crystal 
structures were obtained with the hybrid method described in this work after parameter 
refinement. The agreement is significantly better than in the first two cases. 

Having adjusted the damping radius tn,n to the crystal structures of a-N2 and 7-N 2? we now 
examine how well the parameterized hybrid method performs with respect to the remaining 
experimental information. Fig. 15 shows a superposition of the calculated and the 
experimental crystal structure of e- N 2 . As for 0C-N2 and y-N2 ? the agreement between the 
calculated and the experimental crystal structure is excellent. 

We now turn our attention the relative stability of the three different polymorphs as a function 
of pressure. Lattice enthalpies were calculated for all three polymorphs at 0 MPa, 200 MP a, 
407 MPa, 4 GPa and 7.8 GPa. The results are shown in Fig 1 6. Since the enthalpy changes as 
a function of pressure are significantly larger than the relative enthalpy changes, Fig. 16 
shows relative enthalpies rather than absolute enthalpies. For a given pressure, the average 
enthalpy of the three polymorphs is set to zero. Qualitatively, the experimental order of 
stability is well reproduced. Y-N2 is the most stable crystal structure in the intermediate 
pressure range. At high pressure, 8- N2 is more stable than y-TSb. At low temperature, enthalpy 
of 0C-N2 approaches the enthalpy of y-N 2 - Quantitatively, however, there are some 
disagreements. For the two phase transitions, the calculation yields pressures of about 0 MPa 
and 6.5 GPa, whereas values of 330 MPa and 1.9 GPa have been observed experimentally. 
These disagreements are not necessary related to limitations of the hybrid method. N2 is very 
small (light) molecule, and zero-point translations and rotations can be expected to have a 
significant effect on the relative enthalpies. It can be estimated that zero point effects, 
currently neglected in the calculation, may change the relative stabilities by about 0.05 
kcal/mol. Comparing this value to the energy scale of Fig 16 it is obvious that the current 
neglect of zero-point effects may well be at the origin of the observed mismatch between the 
calculated and the experimental transition pressures. 

The case of molecular nitrogen is an excellent example to illustrate the strong correlation 
between the damping radii and the Cs coefficients that has already been mentioned in section 
5.5. The parameter refinement described in this section was actually carried out for two 
different values of the C 6 coefficient. The first two lines of Tab. 5.7.a show the two values of 



-71- 



WO 2005/091178 



PCT/EP2005/050778 



the C6 coefficient and the resulting values for the damping radius. Lines 3 to 5 show the 
deviations between the calculated and the experimental crystal structures for both cases. The 
last two lines present the enthalpy differences calculated for the two pressures at which the 
phase transitions occur. Ideally, these enthalpy differences should be zero. For the first C6 
coefficient, the parameter refinement was carried out several times with different starting 
points to obtain an estimate of the numerical accuracy of the fitting procedure. The 
corresponding error bars are indicated in Tab. 5.7.a. For the two different parameters, we 
obtain different values of the damping radii, but very similar agreement with the experimental 
data. Both the structural deviations and the enthalpy differences are virtually the same to 
within the calculated error bars. 



Tab 5.7.a: Refinement of the damping radius r for two different values of the C 6 coefficient 
(see text for details). 



L ;6 jA6 kcal{mo t %, ■ - y .r!p* 




aHflHHHNHBHl 






; ;^ 1 :.3.65; ; : ;: ' --^v:/. ][ 


Deformation a-phase [%] 


0.30±0.25 


0.19 


Deformation y-phase [%] 


1.20±0.26 


1.54 


Deformation e-phase [%] 


2.35±0.20 


1.98 


Ea-Ey (330 MPa) [kcal/mol] 


0.034±0.003 


0.032 


Ey-Ee (1.9 GPa) [kcal/mol] 


-0.066±0.001 


-0.066 
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